DESCRIPTION 



NOVEL EXPRESSION VECTOR SUITABLE FOR EXPRESSION OF RECOMBINANT 

PROTEIN AT LOW TEMPERATURE ^ 

Technical Field 

The present invention relates to an expression vector which can inducibly express a 
foreign gene in a bacterium of genus Rhodococcus. 

The present invention also relates to an inducible expression vector which can express a 
recombinant protein in a host cell at low temperature and a method for expressing a 
recombinant protein at low temperature using this vector. Furthermore, the present invention 
relates to an inducible expression vector for a bacterium of genus Rhodococcus which vector 
can inducibly express a gene encoding a protein whose expression product inhibits 
proliferation of the host cell imder medium to high temperature conditions exceeding about 
15°C in a bacteriiun of genus Rhodococcus which can proliferate under low temperature 
conditions and a method for expressing a recombinant protein whose expression product 
inhibits proliferation of the host cell imder medium to high temperature conditions exceeding 
about 15°C at low temperature using a bacterium of genus Rhodococcus which contains this 
vector and can proliferate under low temperature conditions. 

Background Art 

Currently, in order to prepare in mass a protein derived from a eukaryote as a 
recombinant, expression systems using E. colt as a host are widely used (Weickert et al., Curr. 
Opin. Biotechnol. 7 494-499 (1996); Baneyx, Curr. Opin. Biotechnol. 10 41 1-421 (1999)). E, 
coli is a mesophile, and although it grows at 18 to 37°C, culturing temperature for expressing 
the recombinant protein should also be within the above-mentioned temperature range. 
However, since the protein derived from a eukaryote shows activity in the same temperature 
range, some proteins, when made to express in E. coli as a recombinant, may inhibit the 
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growth of E. coll, and consequently, significant quantity of recombinant protein may not be 
obtained. 

Besides those using E. coli, expression systems using eukaryotic cells such as 
Saccharomyces cerevisiae and Pichia pastoris (Cereghino and Cregg, Curt. Opin. Biotechnol. 
10 422-427 (1999)), Sf9 cell (Miller, Curr. Opin. Genet. Dev. 3 97-101 (1993)) as a host are 
known but these systems cannot express the protein efficiently either unless the culturing 
temperature is around 30°C, and the production thereof may be difficult for the similar reason. 
For example, when producing a foreign protein using insect cell Sf9 usually used for the 
production of recombinant protein, the optimal temperature for the production is about 28'»C, 
and the lowest temperature is about 18°C (Agathos et al., Ann. N. Y. Acad. Sci. 589 372-398 
(1990), Faber et al.. Yeast H 1331-1344 (1995)). In the meantime, when producing a 
foreign protein using yeast {Pichia pastoris), the optimal temperature for the production is 
about 30*'C, and the lowest temperature is about IS^C (Brock et al., J. Membr. Biol. 180 
147-161 (2001); Sarramegna et al., Protein Expr. Purif. 24 212-220 (2002)). That is, suitable 
.growth temperature range of the insect cell Sf9 is about 18°C or higher, and the suitable 
growth temperature range of the yeast is about 1 5°C or higher. In addition, when these cells 
are used as a host, some proteins may be modified with oligosaccharide etc., which causes 
inconvenience in the subsequent fimctional analyses such as 3D structure analysis. 

Disclosure of the Invention 

An object of the present invention is to express a protein which carmot be expressed in 
the other recombinant protein expression systems including E. coli expression systems. For 
example, the present invention is directed to express a recombinant protein at low temperature 
which protein cannot be expressed under medium to high temperature conditions exceeding 
15°C in a recombinant host cell such as E. coli. 

Another object of the present invention is to inductively express a foreign recombinant 
protein using a bacterium of genus Rhodococcus. 

In order to solve the above-mentioned problem, it is considered to be effective to make 
it expressed at low temperature to suppress the activity of the recombinant protein. In E. coli, 
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an expression system using a low-temperature inducible promoter at 15 to 16''C is an example 
which produced a recombinant protein at the lowest temperature (JP Patent Publication 
(Kohyo) NO.10-503090A (1998), Mujacic et al.. Gene 238 325-332 (1999)). In addition, 
production of a recombinant protein at 15 to 18°C is production of a recombinant protein at 
the lowest temperature known ever for insect cell and yeast as mentioned above. Therefore, 
it was considered that to make the recombinant protein to be expressed at or lower than 15°C 
to 18°C, the lowest temperatures at which recombinant proteins can be expressed using 
conventionally known host cells express, preferably around 4°C, was effective. However, 
since the growth is difficult and protein production is impossible at or lower than \5°C, 
particularly around 4°C for any of the above-mentioned host cells, use of an expression system 
employing a bacterium which can grow around 4°C as a host is considered to be suitable. 
Accordingly, the present inventors have tried to solve the problems by establishing an 
inducible expression vector using a bacterium of genus Rhodococcus as a host and capable of 
expressing a foreign protein in a wide temperature range (from 4°C to around 32°C). 

Rhodococcus erythropolis (Larkin et al., Antonie van Leeuwenhoek 74 133-153 
(1998)) is an actinomycete growing in a wide temperature range from A^C to 35°C and easy to 
be studied in biogenetics since a composite vector capable of autonomous replication in both 
cell species of this bacterium and E. coli (De Mot et al.. Microbiology 143 3137-3147 (1997)) 
has been developed. 

In addition, as for the bacteria of genus Rhodococcus in general, composite vectors for 
E. coli have been developed (JP Patent Publication (Kokai) No.5-64589A (1993), JP Patent 
Publication (Kokai) No.8-56669A (1996)), and a general-purpose expression vector which 
constitutively expresses a foreign gene also exists (JP Patent Publication (Kokai) 
No.l0-248578A(1998)), 

However, in order to make the protein expressed rapidly at low temperature with 
efficiency, development of an inducible expression vector which easily enables strict and 
powerful regulation of expression of the protein is essential. That is, cells are allowed to 
proliferate at 30°C with the expression suppressed first and then the temperature is lowered to 
4'*C to induce expression. To date, however, there has been no report of such an inducible 

3 



expression vector for this bacillus, and it was considered effective to divert an inducible 
expression system derived from those for other kinds of bacteria. 

Streptomyces coelicolor is a kind of actinomycete as well as Rhodococcus erythropolis, 
and a series of gene clusters whose expression is inducible by the addition of an antibiotic 
thiostrepton are known for this bacterium (Murakami et al, J. Bacteriol. 171 1459-1466 
(1 989)). It has been known that TipA gene, one of these genes, encodes a protein consisted of 
253 amino acids, and this TipA protem covalently binds to thiostrepton, and acts on its own 
promoter domain as a TipA-thiostrepton complex to powerfully promote the transfer from its 
own structural gene (Hohnes et al., EMBO J. 12 3183-3191 (1993), Chiu et al.. Biochemistry 
35 2332-2341 (1996)). In addition, an inducible expression vector using the TipA structural 
gene and TipA gene promoter was also developed, and there is an example which made a 
foreign protein expressed within genus Streptomyces (Enguita et al., FEMS Microbiol. Lett. 
137 135-140 (1996)). It is expected that if a vector having a gene cluster comprised of TipA 
structural gene and a structural gene of the target protein linked downstream to the TipA gene 
promoter introduced is constructed in Rhodococcus erythropolis as well, the vector can be an 
inducible expression vector in the same manner as in the case of a bacterium of genus 
Streptomyces. There has been, however, no such a report. 

Moreover, if the production of a recombinant protein is enabled at low temperature 
about 15°C or lower, particularly at 4°C, it will be considered that not only the protein which 
inhibits the proliferation of a host is enabled but also the following advantages will be 
obtained. 

When E. coli is made to express a recombinant protein at 37°C, it may form an 
aggregated inactive protein called mclusion body. Many examples are known, however, that 
even the same protein is produced as an active and soluble protein when the expression 
temperature is lowered to below 30°C (Schein and Notebom, Bio/Technology 6 291-294 
(1988); Piatak et al., J. Biol. Chem. 263 4837-4843 (1988); Schirano and Shibata, FEBS Lett. 
271 128-130 (1990); Vasnia and Baneyx, Protein Expr. Purif. 9 211-218 (1997); Lin et al.. 
Protein Expr. Purif. 1 169-176 (1990)). Therefore, if an expression system is constructed at 
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low temperature of about IS'^C or lower, particularly around 4°C, this problem requiring 
solubilization is considered to be also solved. 

Furthermore, a low temperature of about 15°C or lower, particularly around 4°C is 
supposed to be also favorable for the production of proteins derived from psychrotrophic 
bacteria whose optimal growth temperature range is 20°C or lower, heterothermic animals 
living under a low-temperature environment and plants living under a low-temperature 
environment. This is because it is considered that these proteins are not expressed as active 
proteins when the temperature is too high. Although there is only one example in which 
a-amyl£ise derived from a psychrotrophic bacterium is expressed using a psychrotrophic 
bacterium as a host (Tutino at al., Extremophiles 5 257-264 (2001)), the case was not with an 
expression inducible vector and it is considered to be difficult to quickly produce the protein in 
mass. 

Thus the present inventors have intensively studied on the construction of an 
expression vector which can inducibly express a foreign protein in a bacterium of genus 
Rhodococcus and an expression vector which can inducibly express a foreign protein at low 
temperature about 15°C or lower, and have completed the present invention. 

That is, the present invention is as follows. 

(1) An expression vector which can inducibly express a foreign gene with an inducing 
substance in a host cell and can be expressed at a temperature below the suitable growth 
temperature r£inge of a host other than said host. 

(2) An expression vector which can inducibly express a foreign gene with an inducing 
substance in a host cell and can be expressed at a temperature of 15°C or lower. 

(3) The expression vector according to (1) or (2) which can be expressed at 4®C. 

(4) The expression vector according to any one of (1) to (3) wherein the host cell is a 
bacterium of genus Rhodococcus. 

(5) The expression vector according to (4) wherein the bacterium of genus Rhodococcus is 
selected from the group consisting of R,erythropolis^ RJascians and R.opacus. 

(6) The expression vector according to any one of (1) to (5) wherein the inducing 
substance is thiostrepton. 
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(7) The expression vector according to any one of (1) to (6) wherein the foreign gene 
encodes a protein which inhibits proliferation of the host cell under medium to high 

temperature conditions exceeding 15°C, 

(8) The expression vector according to any one of (1) to (7) wherein the vector comprises a 
promoter sequence whose expression can be regulated by an inducing substance and a 
multicloning site to which a foreign gene can be introduced. 

(9) A transformant comprising an expression vector according to any one of (1) to (8). 

(10) A method for producing protein using an expression vector according to any one of (1) 
to (8). 

(11) An inducible expression vector which can inducibly express a gene encoding a protein 
which inhibits proliferation of the host cell when it is expressed at a temperature within the 
range of suitable growth temperature of the host cell in another host cell having a suitable 
growth temperature range lower than the suitable growth temperature of said host cell using an 
inducing substance. 

(12) The expression vector according to (1 1) which can be expressed at 4°C. 

(13) The expression vector according to (1 1) or (12) wherein the host cell is a bacterium of 
genus Rhodococcus. 

(14) The expression vector according to (13) wherein the bacterium of genus Rhodococcus 
is selected from the group consisting of R.erythropolis, R/ascians and R.opacus, 

(15) The expression vector according to any one of (11) to (14) wherein the inducing 
substance is thiostrepton. 

(16) The expression vector according to any one of (11) to (15) wherein the vector 
comprises a promoter sequence whose expression can be regulated by an inducing substance 
and a multicloning site to which a foreign gene can be introduced. 

(17) A transformant comprising an expression vector according to any one of (1 1) to (16). 

(18) A method for producing a protein using an expression vector according to any one of 
(11) to (16). 

(19) An expression vector which can inducibly express a foreign gene using an inducing 
substance in a bacterium of genus Rhodococcus. 

6 



(20) The expression vector according to (19) wherein the bacterium of genus Rhodococcus 
is selected from the group consisting of R.erythropolis, RJascians and R.opacus. 

(21) The expression vector according to (19) or (20) wherein the inducing substance is 
thiostrepton. 

(22) An expression vector according to any one of (19) to (21) wherein the vector comprises 
an expression cassette comprising a TipA gene promoter sequence, the first muhicloning site to 
which a foreign gene can be introduced and a transcription termination sequence, an inducer 
cassette comprising the second promoter sequence and TipA gene, a DNA region essential to 
autonomous replication of a plasmid for a bacterium of genus Rhodococcus and a thiostrepton 
resistance gene. 

(23) A transformant of a bacterium of genus Rhodococcus comprising an expression vector 
according to any one of (19) to (22). 

(24) A method for producing a protein using an expression vector according to any one of 
(19) to (22). 

(25) An inducible expression vector for a bacterium of genus Rhodococcus which can 
inducibly express a gene encoding a protein whose expression product inhibits proliferation of 
the host cell under medium to high temperature conditions exceeding IS^'C in a bacterium of 
genus Rhodococcus which can proliferate imder low-temperature conditions. 

(26) An inducible expression vector for a bacterium of genus Rhodococcus whose 
expression can be induced in a bacterium of genus Rhodococcus in which a foreign gene can 
be multiplied under low-temperature conditions, wherein the vector comprises an expression 
cassette comprising a TipA gene promoter sequence, the first multicloning site to which a 
foreign gene can be introduced and a transcription termination sequence, an inducer cassette 
comprising the second promoter sequence and TipA gene, a DNA region essential to 
autonomous replication of a plasmid for a bacterium of genus Rhodococcus and a thiostrepton 
resistance gene. 

(27) The inducible expression vector for a bacterium of genus Rhodococcus according to 
(26) wherein the vector further comprises a DNA region essential to autonomous replication of 
a plasmid for E. coli and can be reproduced in E. coli. 



(28) The inducible expression vector for a bacterixim of genus Rhodococcus according to 
(26) or (27) wherein the TipA gene promoter is TipA-LGlO promoter. 

(29) The inducible expression vector for a bacterium of genus Rhodococcus according to 
any one of (26) to (28) wherein the vector is selected from the group consisting of pTip-NHl 
having a nucleotide sequence represented by SEQ ID No. 106, pTip-NH2 having a nucleotide 
sequence represented by SEQ ID No. 107, pTip-CHl having a nucleotide sequence 
represented by SEQ ID No. 108, pTip-CH2 having a nucleotide sequence represented by SEQ 
ID No. 109, pTip-LNHl having a nucleotide sequence represented by SEQ ID No. 110, 
pTip-LNH2 having a nucleotide sequence represented by SEQ ID No. Ill, pTip-LCHl having 
a nucleotide sequence represented by SEQ ID No. 112, pTip-LCH2 having a nucleotide 
sequence represented by SEQ ID No. 113, pTip-CHl.l, pTip-CH2.1, pTip-LCHl.l and 
pTip-LCH2.1. 

(30) The inducible expression vector for a bacterium of genus Rhodococcus according to 
any one of (25) to (29) wherein the bacterium of genus Rhodococcus is selected from the 
group consisting of Rerythropolis, R/ascians and R.opacus. 

(31) A transformant of a bacterium of genus Rhodococcus comprising an inducible 
expression vector for a bacterium of genus Rhodococcus according to any one of (25) to (30). 

(32) A process for producing a protein whose expression product inhibits proliferation of the 
host cell under medium to high temperature conditions exceeding 15°C wherein the process 
comprises introducing an inducible expression vector for a bacterium of genus Rhodococcus 
according to any one of (25) to (30) comprising as a foreign gene a gene encoding a protein 
difficult to be expressed under medium to high temperature conditions exceeding 15**C into a 
bacterium of genus Rhodococcus which can proliferate at low temperature; and culturing the 
above-mentioned bacteria into which an inducible expression vector for a bacterium of genus 
Rhodococcus is introduced using a culture medium comprising thiostrepton under 
low-temperature. 

(33) The process for producing a protein at low temperature according to (32) wherein the 
protein difficult to be expressed under medium to high temperature conditions exceeding 15°C 
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is a protein which inhibits proliferation of flie host cell under medium to high temperature 
conditions exceeding 15°C. 

(34) The process for producing a protein at low temperature according to (32) wherein the 
protein difficult to be expressed under medium to high temperature conditions exceeding 15°C 
is a protein which forms an inactive inclusion body in E. coli when it is made expressed imder 
medium to high temperature conditions exceeding 15**C. 

(35) A process for producing a protein derived fi-om psychrotrophic bacteria, heterothermic 
animals or plants living under a low-temperature environment wherein the process comprises 
introducing an inducible expression vector for a bacterium of genus Rhodococcus according to 
any one of (25) to (30) comprising a gene encoding a protein derived from psychrotrophic 
bacteria, heterothermic animals or plants living imder a low-temperature environment into a 
bacterium of genus Rhodococcus which can proliferate at low temperature; and culturing the 
above-mentioned bacteria into which an inducible expression vector for a bacterium of genus 
Rhodococcus is introduced using a culture medium comprising thiostrepton under 
low-temperature. 

(36) A process for screening a protein difficult to be expressed under medium to high 
temperature conditions exceeding 15*'C wherein the process comprises introducing an 
inducible expression vector for a bacterium of genus Rhodococcus according to any one of 
(25) to (30) comprising a foreign gene into a bacterium of genus Rhodococcus which can 
proliferate at low temperature; culturing the above-mentioned bacteria into which an inducible 
expression vector for a bacterium of genus Rhodococcus is introduced using a culture medium 
comprising thiostrepton under medium to high temperature conditions exceeding IS^'C and 
under low-temperature condition; and selecting a gene which can be expressed only under 
low-temperature condition equal to or below 15**C. 

(37) The process for screening a protein difficult to be expressed under medium to high 
temperature conditions exceeding 15*^C according to (36) wherein the protein difficult to be 
expressed under medium to high temperature conditions exceeding 15**C is a protein which 
inhibits proliferation of the host cell under medium to high temperature conditions exceeding 
15°C. 
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(38) A process for screening a protein difficult to be expressed under medium to high 
temperature conditions exceeding 15°C wherein the process comprises selecting a gene which 
does not express or inhibits proliferation of E. coli when it is introduced and made expressed 
under medium to high temperature conditions exceeding 15*^C; and then introducing an 
inducible expression vector for a bacterium of genus Rhodococcus according to any one of 
(25) to (30) comprising said gene as a foreign gene into a bacterium of genus Rhodococcus 
which can proliferate at low temperature; and selecting a gene which can be expressed when 
cultured under low-temperature condition using a culture medium comprising thiostrepton. 

(39) The process for screening a protein according to (38) wherein the protein difficult to be 
expressed under medium to high temperature conditions exceeding 15**C is a protein which 
inhibits proliferation of E, coli at 30°C or higher. 

(40) The process for screening a protein according to (38) wherein the protein difficult to be 
expressed under medium to high temperature conditions exceeding 15°C is a protein which 
forms an inclusion body when it is made expressed under medium to high temperature 
conditions exceeding 15°C. 

(41) The process for screening a protein according to (38) wherein the protein diflScult to be 
expressed imder medium to high temperature conditions exceeding 15®C is a protein which 
inhibits proliferation of the host cell under medium to high temperature conditions exceeding 
15^C. 

(42) A protein difficult to be expressed under medium to high temperature conditions 
exceeding 15®C obtained by a screening process according to any one of (36) to (41). 

Hereafter, the present invention is described in detail. 
1. Construction of expression vector of the present invention 

The expression vector of the present invention is capable of autonomous replication in 
a cell which can proliferate at low temperature, and which can inducibly express a foreign 
gene incorporated into the vector, i.e., an inducible expression vector. 

The cell which can proliferate at low temperature is not limited, as long as it is a cell 
which can proliferate at low temperature, and any microbes such as E. coli and yeast, insect 
cells, mammals cells, etc. can be used. In view of secure proliferation at low temperature, 
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bacteria belonging to genus Rhodococcus, preferably R.erythropolis, Rfascians, R.opacus, etc. 
are included. Among these three kinds of a bacterium of genus Rhodococcus, R.erythropolis 
has a highest proliferation rate at and the other two are inferior thereto. However, in the 
production of a protein using a vector of the present invention, cells can be proliferated at a 
temperature suitable for proliferation and then transferred to low temperature condition so that 
the protein can be inductively expressed and produced. Accordingly, as long as the cells can 
express and produce a foreign protein at 4''C, proliferation rate does not matter and all of the 
three kinds species belonging to genus Rhodococcus, i.e., R.erythropolis^ RJascians^ and 
genus Rhodococcus of R.opacus can be suitably used. 

Low temperature refers to a temperature lower than the optimal proliferation 
temperature of usual bacteria, and means a temperature from 4'*C to 18°C, preferably from 
4'*C of IS^'C, particularly preferably around 4'*C. Although the suitable growth temperature 
range of usual bacteria can vary depending on the kind, it is about 15°C to about 40''C, or 
about IS^'C to about 40''C, and temperature exceeding about 15°C is referred to as medium to 
high temperature in this specification. 

A foreign gene is a gene encoding the target protein to be expressed and produced 
using a vector of the present invention, and a gene encoding a protein derived from an 
organism other than the host cell. The foreign gene to be incorporated into a vector of the 
present invention is a gene encoding a protein which is difficult or impossible to be expressed 
at medium to high temperatures exceeding about IS^'C. The protein which is difiScult or 
impossible to be expressed at high temperature exceeding about 15°C means a protein which 
expresses with low efficiency or does not express at all when it is attempted to be expressed at 
medium to high temperatures exceeding about IS^'C. Examples of the proteins include genes 
encoding proteins such as a protein which cannot be expressed in the range of optimal growth 
temperature of the host cell but can be expressed at a temperature lower than the suitable 
growth temperature of the microbe when the identical or different kind of a microbe is used as 
the host cell; a protein which is lethal to the host cell when it is expressed at a temperature 
within the suitable growth temperature range of the host microbe but not lethal to the identical 
or different kind of microbe at a temperature lower than the suitable growth temperature of the 
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microbe; a protein which inhibits proliferation of the host cell when it is expressed at a 
temperature within the suitable growth temperature range of the host cell but does not inhibit 
proliferation of the host cell at a temperature lower than the suitable growth temperature of the 
identical or different kind of host cell; a protein which forms an inactive protein aggregation 
called inclusion body when it is expressed at a temperature within the suitable growth 
temperature range of the host cell but becomes an active and soluble protein when expressed 
in the host cell at a temperature lower than the suitable growth temperature of the identical or 
different kind of host cell; a protein derived from psychrotrophic bacteria whose suitable 
growth temperature range is 20°C or lower, heterothermic animals living under 
low-temperature environment and plants living under low-temperature environment etc. 

When a particular gene is made expressed at medium to high temperatures exceeding 
about IS^'C in an expression system based on E. coli, or the gene is incorporated in an 
expression vector of the present invention and made expressed in a bacterium of genus 
Rhodococcus at medium to high temperatures exceeding about 15°C, and if the gene does not 
express or expresses in an amoimt significantly lower than the amount of the case that a 
foreign gene is incorporated in an expression vector of the present invention and made 
expressed in a bacterium of genus Rhodococcus at low temperature, the protein can be 
recognized as a protein which is difficult or impossible to be expressed at medium to high 
temperatures exceeding about IS^'C. 

For example, when a gene is made expressed using E, coli commonly used for ordinary 
expression production of a recombinant protein and it cannot be expressed at 18 to 37**C which 
is the suitable growth temperature range of E. coli, or the protein is lethal to the E. coli, or 
inhibits proliferation of E. coli, or forms aggregation of inactive inclusion body, the gene of 
the above-mentioned protein can be introduced into Rhodococcus erythropolis and 
Rhodococcus erythropolis can be proliferated at a low temperature of 4 to 18°C to enable an 
efficient production of the above-mentioned protein in large quantities. In addition, when a 
gene is made expressed using Rhodococcus erythropolis at a temperature exceeding about 
IS'^C and it cannot be expressed, or the protein is lethal to the Rhodococcus erythropolis, or 
inhibits proliferation of Rhodococcus erythropolis, it is also possible to perform an efficient 
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production of the above-mentioned protein in large quantities by proliferating Rhodococcus 
erythropolis at a low temperature of 4 to 18®C. 

Although the type of the protein which inhibits proliferation of the host cell at medium 
to high temperatures exceeding about 15^C is not limited, examples thereof include proteins 
illustrated in the below-mentioned Examples. The genes encoding such a protein can be 
incorporate into a site having a multicloning site incorporated downstream the 
below-mentioned promoter. 

The vector which can inducibly express a foreign gene means a vector in which 
expression of a foreign gene incorporated by a certain process is induced. For example, it is 
possible to construct an inducible expression vector by incorporating a promoter whose 
expression can be induced with a specific regulating substance into a vector. Such a 
promoter includes a promoter which can be specifically induced by introducing a chemical 
agent, i.e., an inducing substance into the culture medium of the host cell, for example, TipA 
gene promoter which is a thiostrepton inducing promoter. The target protein can be 
expressed in large quantities by adding a chemical agent which induces expression of the 
protein after sufficiently proliferating the host cell in which has been introduced a vector 
incorporating such an inductive promoter at the temperature suitable for proliferation of the 
cell at a temperature of 15 to 18°C or higher. In addition, TipA gene encoding TipA protein 
and any suitable promoters such as ThcA gene promoter which induces the expression of TipA 
gene may be incorporated. When the host cells are bacteria belonging to genus Rhodococcus, 
since these bacteria are susceptible to thiostrepton, thiostrepton resistance gene or the like 
which confers resistance to thiostrepton is incorporated. 

Moreover, the expression vector of the present invention may contain a dmg resistance 

gene. 

Furthermore, the vector may be a composite vector (shuttle vector) so that it may be 
compatible to two or more host cells. Examples of such a vector include a vector which can 
be introduced into any of bacteria belonging to E, coli and genus Rhodococcus, and can 
express a foreign gene in these host cells. When building such a vector, it is necessary to 
incorporate DNA regions essential to autonomous replication of a plasmid in each host cell. 
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For example, as for a composite vector suitable for bacteria belonging to E, coli and genus 
Rhodococcus, ColE\ sequence may be incorporated as a DNA region essential to autonomous 
replication of a plasmid for E, coli and RepA and RepB genes may be incorporated as DNA 
regions essential to autonomous replication of a plasmid for bacteria belonging to genus 
Rhodococcus. Such a composite vector can be reproduced in large quantities using E. coli. 

The expression vector of the present invention contains at least a DNA sequence having 
the first promoter activity and the first multicloning site for incorporating a foreign gene. 
Furthermore, it may contain DNA region(s) essential to autonomous replication of the first 
plasmid, the first drug resistance gene, a foreign gene linked to the first multicloning site, and 
the first transcription termination sequence. TipA gene promoter can be mentioned as a DNA 
sequence having the first promoter activity and when TipA gene promoter is contained, TipA 
gene and the second promoter sequence such as ThcA gene promoter for expressing TipA gene, 
and the second transcription termination sequence downstream fi-om the TipA gene are 
contained. The TipA gene promoter can be a sequence modified therefirom such as 
TipA-LGlO promoter. Furthermore, when TipA gene promoter inducible expression system 
is contained and the host cell is a bacterium of genus Rhodococcus, thiostrepton resistance 
gene should be contained in order to confer thiostrepton resistance to the bacterium of genus 
Rhodococcus. 

The DNA sequence having a promoter activity, the foreign gene and the transcription 
termination sequence constitute an expression cassette, and TipA gene and the promoter for 
TipA gene expression constitute an inducer cassette. 

The expression vector for a bacterium of genus Rhodococcus of the present invention 
may express the protein not only at low temperature but also at medium to high temperatures 
exceeding 15**C as long as the protein itself can be expressed at medium to high temperatures 
exceeding 15°C. 

pTip vector shown in Fig. 9 can be mentioned as an expression vector of the present 
invention and the vector includes pTip-NHl, pTip-NH2, pTip-CHl, pTip-CH2, pTip-LNHl, 
pTip-LNH2, pTip-LCHl and pTip-LCH2 depending on the structure of the multicloning site 
as shown in Fig. 9a. The sequences of pTip-NHl, pTip-NH2, pTip-CHI, pTip-CH2, 
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pTip-LNHl, pTip-LNH2, pTip-LCHl and pTip-LCH2 vectors are indicated as the SEQ ID 
Nos. 106 to 1 13, respectively. 

In addition, expression vectors of the present invention include pTip-CHl.l, 
pTip.CH2.1, pTip-LCHl.l and pTip-LCH2.1 in which Bgni and Xhol sites are separated in 
pTip-CHl, pTip-CH2, pTip-LCHl, and pTip-LCH2 so that the reading frame of the 
muhicloning site after the Xhol site is in frame with the reading frame of a commercial pET 
vector (Novagen). 

The vector of the present invention can be easily constructed by following the 
description of the below-mentioned Examples and vector construction diagrams of Figs. 1 to 
8. 

2. Use of vector of the present invention 

Use of vector of the present invention enables to produce a protein which is difficult or 
impossible to be expressed at medium to high temperatures exceeding about IS^'C. Examples 
of the proteins include a protein which cannot be expressed at a temperature within the 
suitable growth temperature range of the host cell but can be expressed at a temperature lower 
than the suitable growth temperature of the cell when the identical or different kind of host cell 
is used as the host cell; a protein which is lethal to the host cell when it is expressed at a 
temperature within the suitable growth temperature range of the host cell but not lethal to the 
identical or different kind of host cell at a temperature lower than the suitable growth 
temperature of the host cell; a protein which inhibits proliferation of the host cell when it is 
expressed at a temperature within the suitable growth temperature range of the host cell but 
does not inhibit proliferation of the host cell at a temperature lower than the optimal growth 
temperature of the identical or different kind of host cell; a protein which forms an inactive 
protein aggregation called inclusion body when it is expressed at a temperature within the 
suitable growth temperature range of the host cell but becomes an active and soluble protein 
when expressed in the host cell at a temperature lower than the optimal growth temperature of 
the identical or different kind of host cell; a protein derived from psychrotrophic bacteria 
whose suitable growth temperature range is 20^*0 or lower, heterothermic animals living under 
low-temperature environment and plants living under low-temperature environment. 
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The above-mentioned proteins can be expressed by incorporating a gene encoding such 
a protein into a muhicloning site of the expression vector of the present invention using a 
suitable restriction enzyme, transforming the host cell by this vector, and culturing the host 
cell under low-temperature conditions. The host cell should be a cell which can proliferate at 
low temperature, and a bacterium belong to genus Rhodococcus, preferably R.erythropolis^ 
Rfascians, R.opacus, etc. can be mentioned. Although these cells can proliferate at low 
temperature, suitable temperature for proliferation is 15**C or higher, more preferably IS^C or 
higher and particularly preferably around 30°C, and the protein can be expressed by 
sufficiently proliferating the cells at a temperature suitable for proliferation before expressing 
the protein incorporated in the gene and subsequently placing them under low-temperature 
condition to express the protein utilizing the inducible promoter function contained in the 
vector along with a suitable chemical agent. 

When the vector of the present invention contains TipA gene promoter, expression of 
protein is induced by adding thiostrepton to the culture medium. Under the present 
circumstances, thiostrepton may be added so that the final concentration may be at or more 
than 0.1 M-g/ml, preferably at or more than 1 ixg/ml. However, growth will be deteriorated 
when the final concentration exceeds 10 ^ig/ml. 

Use of vector of the present invention enables to screen a protein which is difficult or 
impossible to be expressed at medium to high temperatures exceeding about 1 5°C. Examples 
of the proteins include a protein which cannot be expressed at a temperature within the 
suitable growth temperature range of the host cell but can be expressed at a temperature lower 
than the suitable growth temperature of the cell when the identical or different kind of host cell 
is used as the host cell; a protein which is lethal to the host cell when it is expressed at a 
temperature within the suitable growth temperature range of the host cell but not lethal to the 
identical or different kind of host cell at a temperature lower than the suitable growth 
temperature of the host cell; a protein which inhibits proliferation of the host cell when it is 
expressed at a temperature within the suitable growth temperature range of the host cell but 
does not inhibit proliferation of the host cell at a temperature lower than the optimal growth 
temperature of the identical or different kind of host cell; and a protein which forms an 
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inactive protein aggregation called inclusion body when it is expressed at a temperature within 
the suitable growth temperature range of the host cell but becomes an active and soluble 
protein when expressed in the host cell at a temperature lower than the optimal growth 
temperature of the identical or different kind of host cell. 

For example, poly(A)^RNA is extracted from a suitable tissue of a suitable animal 
species, cDNA is synthesized, and incorporated in an expression vector. Subsequently, host 
cells such as E, colt are transformed using this vector, an expression library is constructed, 
proliferated and expressed at 30°C, and the gene is isolated from the clone in which the gene is 
incorporated and whose proliferation has been inhibited whereby genes for encoding a protein 
which is lethal to the host cell when it is expressed at a temperature within the suitable growth 
temperature range of the host cell but not lethal to the identical or different kind of host cell at 
a temperature lower than the suitable growth temperature of the host cell and a protein which 
inhibits proliferation of the host cell when it is expressed at a temperature within the suitable 
growth temperature range of the host cell but does not inhibit proliferation of the host cell at a 
.temperature lower than the optimal growth temperatvire of the identical or different kind of 
host cell are selected. For this purpose, a promoter inducible by a suitable chemical agent is 
incorporated in the expression vector and clones may be selected by which proliferation of the 
host cells is inhibited when the expression is induced by the chemical agent while they can 
proliferate when the expression is not induced. Subsequently, isolated genes are incorporated 
in the expression vector of the present invention and Rhodococcus erythropolis is transformed 
by this recombinant expression vector, proliferated and expressed at a low temperature from 4 
to 15°C, and clones are selected which express the gene without being inhibited from 
proliferation whereby the gene encoding the above-mentioned protein can be screened. 
Altematively, genes of the cDNA library are incorporated in the expression vector of the 
present invention and Rhodococcus erythropolis is transformed by this recombinant 
expression vector, proliferated and expressed at a low temperature or medium to high 
temperatures exceeding about IS'^C, and clones are selected which express the incorporated 
gene without being inhibited from proliferation or clones are selected in which the gene being 
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expressed upon expression induction is incorporated whereby the gene encoding the 
above-mentioned protein can be screened. 

The present invention encompasses the proteins obtained by the above-mentioned 
screening which are difficult or impossible to be expressed at medium to high temperatures 
exceeding about 15°C. These proteins can be exemplified by the proteins illustrated in the 
following Example. 

This specification incorporates the contents disclosed in the specification and/or 
drawings of Japanese Patent AppHcation No. 2002-235008, from which the priority of the 
present application is claimed. 

Brief Description of the Drawings 

Fig. 1 is a diagram illustrating the construction of plasmid pHN136 which constitutes 
the backbone of the inducible expression vector. The positions of the restriction enzyme 
recognition site and the structural gene are shown in the drawing. The numbers indicate the 
number of base pairs (kilo base pair: kb); 

Fig. 2 is a diagram illustrating the construction of plasmid pHN143 having a 
thiostrepton resistance gene. The positions of the restriction enzyme recognition site and the 
structural gene are shown in the drawing. The numbers indicate the number of base pairs 
(kilo base pair: kb). CIAP means Calf Intestine Alkaline Phosphatase and Blu. means a blunt 
end; 

Fig. 3 is a diagram illustrating the construction of plasmid pHN62 having an inducer 
cassette. The positions of the restriction enzyme recognition site and the structural gene are 
shown in the drawing. The numbers indicate the number of base pairs (kilo base pair: kb). 
Blu. means a blunt end; 

Fig. 4 is a diagram illustrating the construction of plasmid pHN153 having an 
expression cassette. The positions of the restriction enzyme recognition site and the 
structural gene are shown in the drawing. The numbers indicate the number of base pairs 
(kilo base pair: kb). CIAP means Calf Intestme Alkaline Phosphatase and Blu. means a blunt 
end; 
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Fig. 5 is a diagram illustrating the construction of plasmid pHN169 having a 
tetracycline resistance gene. The positions of the restriction enzyme recognition site and the 
structural gene are shown in the drawing. The numbers indicate the number of base pairs 
(kilo base pair: kb), CIAP means Calf Intestine Alkaline Phosphatase and Blu. means a blunt 
end; 

Fig. 6 is a diagram illustrating the construction of inducible expression vector plasmids 
pHN170 and pHN171 which have PIP as a reporter gene. The positions of the restriction 
enzyme recognition site and the structural gene are shown in the drawing. The numbers 
indicate the number of base pairs (kilo base pair: kb), CIAP means Calf Intestine Alkaline 
Phosphatase. A series of steps is shown by dividing into two parts in Fig. 6, but there is an 
overlap in the two portions so that the order of the steps may become clear; 

Fig. 7 is a diagram illustrating the construction of inducible expression vector plasmids 
pTip-NHl, pTip-CHl, pTip-LNHl and pTip-LCHl having a multicloning site. The 
positions of the restriction enzyme recognition site and the structural gene are shown in the 
drawmg. The numbers indicate the number of base pairs (kilo base pair: kb). A series of 
steps is shown by dividing into two parts in Fig. 7, but there is an overlap in the two portions 
so that the order of the steps may become clear; 

Fig, 8 is a diagram illustrating the construction of inducible expression vector plasmids 
pTip-NH2, pTip-CH2, pTip-LNH2 and pTip-LCH2 having a multicloning site. The 
positions of the restriction enzyme recognition site and the structural gene are shown in the 
drawing. The numbers indicate the number of base pairs (kilo base pair: kb). A series of 
steps is shown by dividing into two parts in Fig. 8, but there is an overlap in the two portions 
so that the order of the steps may become clear; 

Fig. 9a is a) a diagram showing the map of pTip-NHl, pTip-CHI, pTip-LNHl, 
pTip-LNHl, pTip-NH2, pTip-CH2, pTip-LNH2 and pTip-LCH2. The function of each 
domain and the map of the plasmids are shown; 

Fig. 9b shows b) the DNA sequence from TipA gene promoter sequence of pTip-NHl, 
pTip-LNHl, or from TipA-LGlO promoter sequence to the multicloning site and the 
transcription termination sequence of ThcA gene; 
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Fig. 9c shows c) the DNA sequence from TipA gene promoter sequence of pTip-CHl, 
pTip-LCHl, or from TipA-LGlO promoter sequence to the multicloning site and the 
transcription termination sequence of The A gene; 

Fig. 9d shows d) the DNA sequence from TipA gene promoter sequence of pTip-NH2, 
pTip-LNH2, or from TipA-LGlO promoter sequence to the multicloning site and the 
transcription termination sequence of ThcA gene; 

Fig. 9e shows e) the DNA sequence from TipA gene promoter sequence of pTip-CH2, 
pTip-LCH2, or from TipA-LGlO promoter sequence to the multicloning site and the 
transcription termination sequence of ThcA gene; 

Fig. 10 is a diagram showing the map of pTip-CHl.l, pTip-LCHl.l, pTip-CH2.1 and 
pTip-LCH2.1; 

Fig. 11 is a diagram illustrating the construction of the control plasmids pHN172 and 
pHN173 for PIP activity measurement. The positions of the restriction enzyme recognition 
site and the structural gene are shown in the drawing. The numbers indicate the number of 
base pairs (kilo base pair: kb). CLAP means Calf Litestine Alkaline Phosphatase. pHN170 
has both of an "expression cassette" and an "inducer cassette" whereas pHN173 h£is only an 
"expression cassette" and pHN172 does not have neither of the cassettes; 

Fig. 12 is a figure showing the result of the measurement 1 of the PIP activity using the 
inducible expression vector; 

Fig. 13 is a graph showing the result of the measurement 2a of the PIP activity using 
the inducible expression vector; 

Fig. 14 is a graph showing the result of the measurement 2b of the PIP activity using 
the inducible expression vector; 

Fig. 15 is a figure showing the result of the measurement 3 of the PIP activity using the 
inducible expression vector; 

Fig. 16 is a photograph in which the result of purification 1 of the foreign protein using 
the inducible expression vector; 

Fig. 17 is a photograph in which the result of purification 2 of the foreign protein using 
the inducible expression vector; 
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Fig. 18 is a photograph in which the result of purification 3a of the foreign protein 
using the inducible expression vector; 

Fig. 19 is a photograph in which the result of purification 3b of the foreign protein 
using the inducible expression vector; 

Fig. 20 is a figure showing the list of proteins which inhibit proliferation of E, coli at 

30*^C; 

Fig. 21 is a figure showing expression of the foreign protein using Rhodococcus 
erythropolis and E. coli as a host; 

Fig. 22 is a figure showing TipA gene promoter sequence; £ind 

Fig. 23 is a figure showing an improvement of the RBS sequence (WTRBS) in TipA 
gene promoter to LGIORBS. 

Best Mode for Carrying Out the Invention 

The present invention will be described in more detail by way of the following 
examples. The technical scope of the present invention, however, is not limited to these 
examples. 
[Example 1] 

Isolation of the plasmid derived from Rhodococcus erythropolis which can autonomously 
replicate in a bacterium of genus Rhodococcus and determination of a partial DNA sequence 
thereof 

In order to create a composite vector for Rhodococcus erythropolis and E. coli, small 
endogenous plasmids occurring in a bacterium of genus Rhodococcus were first searched. 
As a result, there was one observed in Rhodococcus erythropolis JCM2895 strain. This 
plasmid was named pRE2895. Isolation of the plasmid and determination of a partial DNA 
sequence thereof are specifically described below. 

pRE2895 was purified using QIAprep Spin MiniprepKit (product of QIAGEN) from 
cell bodies obtained by culturing Rhodococcus erythropolis JCM2895 strain using 5 ml LB 
culture medium (1% Difco Bacto Tryptone, 0.5% Difco Yeast Extract, 1.0% sodium chloride) 
at 30^C for 30 hours. This procedure was conducted following the manufacturer's 
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instructions except that 5 \xl of lysozyme (100 mg/ml) was added followed by incubation at 
Sy^C for 30 minutes after the sample was dispersed in 250 of buffer PI and before 250 |ii of 
buffer P2 was added. 

The above-mentioned DNA sample was processed with restriction enzyme ^coRI and 
subjected to 1.0% agarose gel electrophoresis (100 V, 30 minutes), and existence of one DNA 
fragment of about 5.4 kb was observed. 

This DNA fragment of about 5.4 kb was excised from the gel, and purified with 
QIAquick Gel Extraction Kit (product of QIAGEN) following the manufacturer's instructions. 
The obtained EcoRl fragment was subcloned into the EcoRL site of pleismid pBluescript II 
SK(+) (product of STRATAGENE) according to an ordinary method (Sambrook et al., 
Molecular Cloning: a laboratory manual, 2nd edition (1989), Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, N. Y.), and this plasmid was named pHN79. 

The nucleotide sequence of pHN79 was sequenced by using DNA sequencer ABI 
PRISM (R) 3100 Genetic Analyzer (product of ABI Co. Ltd.) with both of Reverse and 
Ml 3-20 primers (products of STRATAGENE) following the manufacturer's instructions by 
about every 400 bases, respectively. Homology search revealed that the DNA region derived 
from Rhodococcus erythropolis JCM 2895 strain which was subcloned into pHN79 agreed 
with pN30, a circular DNA of 5403 base pairs which was registered in GenBank xmder 
Accession Number AF 3 12210 by 99.8% of the sequence. 

Although the full length sequence of the isolated pRE2895 was not determine, it was 
highly homologous with pN30, and since the restriction enzyme cleavage map thereof agreed 
with the map expected from pN30, it was expected that they are homologous throughout the 
whole plasmid. In the meantime, pN30 was highly homologous with endogenous plasmid 
pAL5000 isolated from Mycobacterium fortuitum 002 strain (Rauzer et al., Gene 71 315-321 
(1988); Stoh and Stoker, Microbiology 142 2795-2802 (1996)) and pFAJ2600 isolated from 
Rhodococcus erythropolis NI 86/21 strain (De Mot et al.. Microbiology 143 3137-3147 
(1997)) and it was supposed that they are autonomously replicated by a similar mechanism. 
Since a region containing presumed RepA gene, presumed RepB gene and presumed 
replication origin was suflFicient for the autonomous replication of pAL5000 in each bacterium, 
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it was considered sufficient for the autonomous replication of pRE2895 in a bacterium of 
genus Rhodococcus to merely incorporate a similar region into an expression vector. 

[Example 2] 

Construction of vector plasmid pHN136 

The following procedure was conducted in order to create a composite vector for both 
the bacteria using a part of pRE2895 isolated in Example 1 and a part of plasmid which can 
autonomously replicate in E, coli (Fig. 1). 

Amplification of DNA was conducted using plasmid pBluescript II SK (-) (product of 
STRATAGENE) as a template along with synthetic oligodeoxyribonucleotide primers 
(hereinafter abbreviated as primers) of SEQ ID Nos. 1 and 2 in the Sequence Listing according 
to polymerase chain reaction method (herein£ifter abbreviated as PGR: Saiki et £d.. Science, 
239 487-491 (1988)). The enzyme used for PGR is Pfu turbo (product of STRATAGENE). 
Gonsequently, amplified DNA of 2.0 kb containing an ampicillin resistance gene (designated 
as Amp^ in the drawings) and a Co/El sequence region required for autonomous replication in 
E, coli was obtained. Double digestion of this DNA fi'agment was carried out with restriction 
enzymes Sad and BsrGl, and the products were subjected to 1,0% agarose gel electrophoresis 
(100 V, 30 minutes). The DNA fi'agment was excised, and purified with QLAquick Gel 
Extraction Kit according to the manufacturer's instructions. 

A primer which amplifies the region considered to be required for autonomous 
replication in a bacterium of genus Rhodococcus was designed based on the sequence of pN30 
(Example 1). The sequences of these primers are shown by SEQ ID Nos. 3 and 4 in the 
Sequence Listing. PGR was performed using plasmid pHN79 as a template along with both 
the primers, and amplified DNA having 1.9 kb was obtained. Double digestion of this DNA 
fi-agment was carried out with restriction enzymes ^^-rGI and Sacl^ and the products were 
subjected to 1.0% agarose gel electrophoresis (100 V, 30 minutes). The DNA fragment was 
excised, and purified in the same manner as in the above-mentioned method. 

The two above-mentioned purified DNA firagments were ligated using DNA Ligation 
Kit Ver.2 (product of TAKARA SHUZO GO., LTD.) following the manufacturer's 
instructions, and the obtained plasmid was named pHN129. 
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Next, the following procedure was performed in order to remove the restriction enzyme 
recognition sites BamHI and Sail which exist in pHN129. First, amplification by PCR was 
performed using pHN129 as a template along with primers of SEQ ID Nos. 5 and 6 in the 
Sequence Listing. A DNA fragment of 0.5 kb obtained by carrying out double digestion of 
this PCR fragment by BgUl and Pstl was subcloned into BamHL and Pstl sites of pHN129. 
As a result, BamHI recognition site was removed without substituting any encoded amino acid 
although the connecting site of BgUl and BamHI was within the open reading frame 
(hereinafter abbreviated as ORF) of a presumed RepA gene. In addition, Sail recognition site 
located very close to the BamHI recognition site was also removed simultaneously with the 
BamHI recognition site in the primer of SEQ ID No. 5 since the primer was designed so that 
Sail recognition site was removed and any encoded amino acid would not be substituted. 
This plasmid was named pHN135. 

Next, the following procedure was performed in order to remove the restriction enzyme 
recognition site BgUl which exists in pHN135. First, amplification by PCR was performed 
using pHN135 as a template along with primers of SEQ ID Nos. 5 and 6 in the Sequence 
Listing. A DNA fragment of 0.5 kb obtained by carrying out double digestion of this PCR 
fragment by Pstl and BamHI was subcloned into Pstl and BgUl sites of pHN135. As a result, 
BgUl recognition site was removed without substituting any encoded amino acid although the 
connecting site of BamHI and Bglll was within the ORF of a presumed RepB gene. This 
plasmid was named pHN136. 
[Example 3] 

Construction of vector plasmid pHN143 

Since Rhodococcus erythropolis is susceptible to thiostrepton, an antibiotic used for 
induction expression of protein, it should be imparted with resistance to this substance. Thus, 
the present inventors decided to incorporate tsr gene, a thiostrepton resistance gene which 
Streptomyces azureus has (Bibb et al., Mol. Gen. Genet. 199 26-36 (1985) - designated as 
Thio' in the drawing) into the composite vector. It was already reported that this gene 
functions within Rhodococcus erythropolis and confers thiostrepton resistance (Shao and 
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Behki, Lett. Appl. Microbiol. 21 261-266 (1995)). Below, isolation of this gene is described 
specifically (Fig. 2). 

First, the genome DNA of Streptomyces azureus JCM4217 strain used as the template 
of PGR was prepared as follows. This strain cultured at SO'^C by using 5 ml of SB culture 
medium (1% Difco Bacto Tryptone, 0.5% Difco Yeast Extract, 0.5% sodium chloride, 0.1% 
Glucose, 5 mM magnesium chloride, 0.5% glycine) was suspended into 500 |li1 of SET buffer 
(75 mM sodium chloride, 25 mM EDTA (pH 8.0), 20 mM Tris-HCl (pH 7.5)). 5 nl of a 
lysozyme solution (100 mg/ml) was added thereto and incubated at 37°C for 30 minutes. 
Then 14 |il of a protease K solution (20 mg/ml) and 60 jil of a sodiimi dodecylsulfate solution 
(10%) were added, and after mixing well, incubated at 55°C for 2 hours. Then, 200 j^l of a 
sodium chloride solution (5 M) and 500^il of chloroform were added, and rotation agitation 
was carried out at room temperature for 20 minutes. Centrifugal separation was carried out 
and 700 |iil of supernatant was removed. This was dried after isopropanol precipitation, and 
dissolved in 50|al of a TE solution (10 mM Tris-HCl (pH 8.0) and 1 mM EDTA (pH 8.0)). 

Amplification by PGR was performed using as a template the genome DNA of 
Streptomyces azureus JCM4217 strain purified as mentioned above along with primers of SEQ 
ID Nos. 7 and 8 in the Sequence Listing. Consequently, an amplified DNA of 1.1 kb 
containing a thiostrepton resistance gene was obtained. This DNA fi-agment was obtained by 
using platinum Pfx DNA polymerase (product of Gibco BRL), and therefore the end thereof is 
a blunt end. This DNA fi-agment was purified and after phosphorylated at 5' end by 
T4-polynucleotide kinase according to an ordinary method (Sambrook et al.. Molecular 
Cloning: a laboratory manual, 2nd edition (1989), Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, N.Y.) subcloned into Hindi site of plasmid pGEM-3Zf(H-)(product of 
Promega) (the direction of subcloning was a Hindlll recognition site-tsr gene ORF-EcoRI 
recognition site fi-om 5' end of DNA). This plasmid was named pHNl 37. 

Next, the following procedure was performed in order to remove the restriction enzyme 
recognition site Sail which exists in pHN137. First, amplification by PCR was performed 
using plasmid pHN137 as a template along with primers of SEQ ID Nos. 9 and 10 in the 
Sequence Listing. Platinimi Pfx DNA polymerase was used for this PCR. A DNA 
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fragment of 0.6 kb obtedned by digesting one end of this PGR fragment by HincUlI was 
purified and phosphorylated at 5' end of the side of a blunt end by T4-polynucleotide kinase 
following an ordinary method. In the meantime, amplification by PGR was performed using 
plasmid pHN137 as a template along with primers of SEQ ID Nos. 1 1 and 12 in the Sequence 
Listing. Platinum Pfic DNA polymerase was used for this PGR. A DNA firagment of 0.5 kb 
obtained by digesting one end of this PGR firagment by EcoKl was purified and fiuther 
phosphorylated at 5' end of the blunt end side by T4-polynucleotide kinase following an 
ordinary method. As a result of subcloning these two PGR fi-agments into Hmdlll and EcoRI 
sites of plasmid pGEM-3Zf(+) simultaneously. Sail recognition site was removed without 
substituting any encoded amino acid although the connecting site of the blimt ends was within 
the ORF of tsr gene. This plasmid was named pHN143. 
[Example 4] 

Gonstruction of vector plasmid pHN62 

In order to carry out induced type expression by thiostrepton, TipA protein there must 
be present in a bacterivim of genus Rhodococcus. Therefore, a constitutive promoter was 
separated firom Rhodococcus erythropolis and the structural gene encoding the TipA protein 
was linked to the downstream thereof (Fig. 3). As a promoter which fimctions constitutively, 
a promoter sequence of ThcA gene encoding the aldehyde dehydrogenase-like protein of 
Rhodococcus erythropolis (Nagy et al., J. Bacteriol. 177 676-687 (1995)) was used. 

The genome DNA of Streptomyces coelicolor A3 (2) strain used as a template was 
purified in the similar manner as in the preparation procedure of genome DNA firom 
Streptomyces azureus. The genome DNA of Rhodococcus erythropolis JGM3201 strain used 
as a template was purified in the similar manner as in the preparation procedure of genome 
DNA fi-om Streptomyces azureus except culturing was perform by using 5 ml LB culture 
medium. 

Amplification by PGR was performed using as a template the genome DNA of 
Streptomyces coelicolor A3 (2) strain purified as mentioned above along with primers of SEQ 
ID Nos. 13 and 14 in the Sequence Listing. Platinum Pfx DNA polymerase was used for this 
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PGR. Consequently, DNA (designated as TipA in the drawing) containing ORP of TipA 
gene and the transcription termination sequence downstream thereof was obtained. 

A DNA fragment of 0.9 kb obtained by digesting one end of this PGR fragment by 
Bglil was purified and phosphorylated at 5' end by the side of a blimt end by 
T4-polynucleotide kinase following an ordinary method. In the meantime, amplification by 
PGR was performed using as a template the genome DNA of Rhodococcus erythropolis 
JGM3201 strain along with primers of SEQ ID Nos. 15 and 16 in the Sequence Listing. 
Consequently, DNA containing a promoter sequence (designated as ALDHp in the drawing) 
of ThcA gene (Nagy et al., J. Bacteriol. 177 676-687 (1995)) encoding an aldehyde 
dehydrogenase-like protein was obtained. Platinum Pfx DNA polymerase was used for this 
PGR. A DNA fragment of 0.2 kb obtained by digesting one end of this PGR fragment by 
Xbal was purified and fVirther phosphorylated at 5' end of the blunt end side by 
T4-polynucleotide kinase following an ordinary method. As a result of subcloning these two 
PGR fragments into Xbal and BamlU sites of plasmid pGEM-3Zf(+) simultaneously, a 
plasmid containing ORF of TipA gene and the transcription termination sequence closely 
downstream from the promoter sequence of ThcA gene was obtained. This plasmid was 
named pHN33. 

Next, the following procedure was performed in order to remove two restriction 
enzyme Ncol recognition sites (hereinafter designated as Ncol (1) and Ncol (2)) which exist in 
pHN33. 

First, amplification by PGR was performed using plasmid pHN33 as a template along 
with primers of SEQ ID Nos. 9 and 17 in the Sequence Listing. Platinum Pfx DNA 
polymerase was used for this PGR. A DNA fragment of 0.5 kb obtained by digesting one 
end of this PGR fragment by Xbal was purified and phosphorylated at 5' end of the side of a 
blunt end by T4-polynucleotide kinase following an ordinary method. In the meantime, 
amplification by PGR was performed using plasmid pHN33 as a template along with primers 
of SEQ ID Nos. 18 and 12 in the Sequence Listing. Platinum Pfx DNA polymerase was used 
for this PGR. A DNA fragment of 0.6 kb obtained by digesting one end of this PGR 
fragment by Kpnl was purified and phosphorylated at 5* end of the side of a blunt end by 
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T4-polynucleotide kinase following an ordinary method. As a result of subcloning these two 
PGR fragments into Xbal and Kpnl sites of plasmid pGEM-3Zf(+) simultaneously, the Ncol{\) 
recognition site was removed without substituting any encoded amino acid although the 
connecting site of the blunt ends was within the ORF of TipA gene. This plasmid was named 
pHN50. 

Next, the following procedure was performed in order to remove the restriction enzyme 
recognition site Ncol (2) which exists in pHN33. First, amplification by PGR was performed 
using plasmid pHN33 as a template along with primers of SEQ ID Nos. 9 and 19 in the 
Sequence Listing. Platinum Pfx: DNA polymerase was used for this PGR. A DNA 
fragment of 0.8 kb obtained by digesting one end of this PGR fragment by Xbal was purified 
and phosphorylated at 5' end of the side of a blunt end by T4-polynucleotide kinase following 
an ordinary method. In the meantime, amplification by PGR was performed using pHN33 as 
a template along with primers of SEQ ID Nos. 20 and 12 in the Sequence Listing. Platinum 
Pfx DNA polymerase was used for this PGR. A DNA fragment of 0.3 kb obtained by 
digesting one end of this PGR fragment by Kpnl was purified and phosphorylated at 5' end of 
the side of a blunt end by T4-polynucleotide kinase following an ordinary method. As a 
result of subcloning these two PGR fragments into Xbal and Kpnl sites of plasmid 
pGEM-3Zf(+) simultaneously, the Ncol(2) recognition site was removed without substituting 
any encoded amino acid although the connecting site of the blunt ends was within the ORF of 
TipA gene. This plasmid was named pHN5 1 . 

The following procedure was performed at the end. A DNA fragment of 0.7 kb 
obtained by double digestion of pHN50 by Xbal and Sad and a DNA fragment of 0.4 kb 
obtained by double digestion of pHN51 by Sad and Kpnl were subcloned into Xbal and Kpnl 
sites of plasmid pGEM-3Zf(+) simultaneously. As a result, a plasmid having a TipA gene 
lacking in both the iVcoI(l) and A^coI(2) recognition sites was acquired, and this was named 
pHN62. 
[Example 5] 

Gonstruction of vector plasmid pHN153 
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In order to confirm whether the target protein can be inducibly expressed, ORF of a 
gene encoding proline iminopeptidase derived from Thermoplasma acidophilum (Tamura et 
al., FEBS Lett. 398 101-105 (1996); hereinafter abbreviated as PIP) was linked downstream 
from the promoter of a TipA gene as a reporter gene ORF and a transcription termination 
sequence was linked further downstream therefrom to suppress the read-through of 
transcription. Details are specifically described below (Fig. 4). 

Amplification by PGR was performed using as a template the genome DNA of the 
Streptomyces coelicolor A3 (2) strain purified in Example 4 along with primers of SEQ ID 
Nos. 21 and 22 in the Sequence Listing. Consequently, an amplified DNA of 0.2 kb 
containing a promoter sequence of TipA gene (designated as TipAp in the drawmg) was 
obtained. Platinum Pfic DNA polymerase was used for this PGR. This fragment was 
purified and phosphorylated at 5* end by T4-polynucleotide kinase following an ordinary 
method and subcloned into Smal site of plasmid pBluescript II SK (+) (the direction of 
subcloning was a Kpnl recognition site-rip^ gene promoter sequence-iSacI recognition site 
from 5' end of DNA). This plasmid was named pHN 1 50u. 

Next, amplification by PGR was performed using plasmid pRSET-PIP (Tamura et al., 
FEBS Lett. 398 101-105 (1996) — heremafter abbreviated as PIP) as a template along with 
primers of SEQ ID Nos. 23 and 24 in the Sequence Listing. The primer of SEQ ID No. 24 in 
the Sequence Listing is designed so that the termination codon of PIP gene is removed and a 
6xHis tag is attached to the G-terminal of the PIP protein for facilitating purification of the 
protein. The 6xHis tag consists of six contiguous histidine residues and a protein fused with 
this exhibits high affinity to nickel ion, etc. Therefore, purification becomes easy by the 
metal chelate chromatography using nickel ion etc. (Growe et al., Methods Mol. Biol. 31 
371-387 (1994)). As a resuh of double digestion of a DNA fragment of 0.9 kb containing 
this PIP gene by restriction enzymes Ncol and Spel and subcloning into Ncol and Spel sites of 
pHNlSOu, a plasmid containing ORF of PIP gene immediately downstream from the promoter 
sequence of TipA gene was created and named pHNl 5 lu. 

Next, amplification by PGR was performed using as a template the genome DNA of the 
Rhodococcus erythropolis JGM3201 strain purified in Example 4 along with primers of SEQ 
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ID Nos. 25 and 26 in the Sequence Listing. Consequently, DNA containing the transcription 
termination sequence of ThcA gene (Nagy et al., J. Bacteriol. 177 676-687 (1995) — 
designated as ALDHt in the drawing) was obtained. Double digestion of this DNA fragment 
of 0.2 kb was carried out by restriction enzymes Spel andXbal, and the product was subcloned 
into Spel and Xbal sites of pHN151u. Consequently, a plasmid containing ORF of PIP gene 
immediately downstream from the promoter sequence of TipA gene and further containing a 
transcription termination sequence of ThcA gene downstream therefrom was created, and 
named pHN153. 
[Example 6] 

Construction of vector plasmid pHN169 

In order to carry out the transformation of Rhodococcus etythropolis by plasmid, a 
suitable transformation marker is needed. Therefore, the present inventors decided to 
connect and use a drug resistance gene downstream from a powerful promoter which functions 
within a bacterium of genus Rhodococcus. As a promoter, the present inventors selected to 
use Tufl gene promoter which encodes the elongation factor Tu derived from a bacterium of 
genus Streptomyces. This is because there has been a report that this promoter powerfully 
transcribes a downstream gene (Wezel et al., Biochim. Biophys. Acta 1219 543-547 (1994)). 
As a drug resistance gene, a tetracycline resistance gene which is easily available was used. 
Details are specifically described below (Fig. 5). 

AmpUfication by PCR was performed using as a template the genome DNA of the 
Streptomyces coelicolor A3 (2) strain purified in Example 4 along with primers of SEQ ID 
Nos. 27 and 28 in the Sequence Listing, Consequently, an amplified DNA of 0.2 kb 
containing a promoter sequence of Tufl gene (designated as Tuflp in the drawing) was 
obtained. Platinum Pfx DNA polymerase was used for this PCR. This fragment was 
purified and phosphorylated at 5' end by T4-polynucleotide kinase following an ordinary 
method and subcloned into Hindi site of plasmid pBluescript II SK (+) (the direction of 
subcloning was a Kpnl recognition sit^-Tufl gene promoter sequence- EcdRL recognition site 
from 5* end of the DNA). This plasmid was named pHNl 58. 
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Next, amplification by PGR was performed using plasmid pACYC184 (Rose, Nucleic 
Acids Res. 16 355 (1988)) as a template along with primers of SEQ ID Nos. 29 and 30 in the 
Sequence Listing. Consequently, DNA containing a tetracycline resistance gene (designated 
as Tet*^ in the drawing) was obtained. Double digestion of this DNA fragment of 1.3 kb was 
carried out by restriction enzymes Xhol and Spel, and the product was subcloned into Sail and 
Spel sites of pHN158. Consequently, a plasmid containing a tetracycline resistance gene was 
created immediately downstream of the promoter sequence of Tufl gene, and named pHN159. 

Next, the following procedure was performed in order to remove the restriction enzyme 
recognition site BamUl which exists in pHN159. First, amplification by PCR was performed 
using plasmid pHN159 as a template along with primers of SEQ ID Nos. 31 and 32 in the 
Sequence Listing. Since the DNA fi-agment is Pfii turbo DNA polymerase, both ends were 
blunt ends. A DNA fi-agment of 0.5 kb obtained by digesting one end of this PCR fi-agment 
by Xhol was purified and phosphorylated at 5' end of the side of the blunt end by 
T4-polynucleotide kinase following an ordinary method. In the meantime, amplification by 
PCR was performed using pHN159 as a template along with primers of SEQ ID Nos. 33 and 
34 in the Sequence Listing. Pfii turbo DNA polymerase was used for this PCR. A DNA 
fragment of 1.1 kb obtained by digesting one end of this PCR fragment by Notl was purified 
and phosphorylated at 5' end of the side of a blunt end by T4-polynucleotide kinase following 
an ordinary method. As a result of subcloning these two PCR fragments into Xhol and Notl 
sites of plasmid pBluescript II SK (+) simultaneously, the BamHl site was removed without 
substituting any encoded amino acid although the connecting site of the blunt ends was within 
the ORF of a tetracycline resistance gene. This plasmid was named pHN165. 

Next, the following procedure was performed in order to remove the restriction enzyme 
recognition site Sail which exists in pHN159. First, amplification by PCR was performed 
using plasmid pHN159 as a template along with primers of SEQ ID Nos, 31 and 35 in the 
Sequence Listing. Pfii turbo DNA polymerase was used for this PCR. A DNA fragment of 
0.8 kb obtained by digesting one end of this PCR fragment by Xhol was purified and 
phosphorylated at 5' end of the side of a blunt end by T4-polynucleotide kinase following an 
ordinary method. In the meantime, amplification by PCR was performed using pHN159 as a 
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template along with primers of SEQ ID Nos. 36 and 34 in the Sequence Listing. Pfii turbo 
DNA polymerase was used for this PGR. A DNA fragment of 0.8 kb obtained by digesting 
one end of this PGR fragment by Notl was purified and phosphorylated at 5' end of the side of 
a blunt end by T4-polynucleotide kinase following an ordinary method. As a result of 
subcloning these two PGR fragments into Xhol and Notl sites of plasmid pBluescript II SK (+) 
simultaneously, the Sail recognition site was removed without substituting any encoded amino 
acid although the connecting site of the blunt ends was within the ORF of a tetracycline 
resistance gene. This plasmid was named pHNl 66. 

The following procedure was performed at the end. A DNA fragment of 0.9 kb 
obtained by double digestion of pHN166 by Sphl and Spel was subcloned into Sphl and Spel 
sites of plasmid pHN165, As a result, a clone having a tetracycline resistance gene lacking in 
both the BamY¥L and Sail recognition sites was acquired, and this plasmid was named pHN169. 
[Example 7] 

Gonstruction of vector plasmids pHN170 and pHNlTl 

The following procedure was performed to link the gene clusters isolated in Examples 
2 to 6 together and to construct an expression vector which can be induced in a bacteriimi of 
genus Rhodococcus (Fig. 6). 

A DNA fragment of 1.1 kb obtained by digesting pHN143 by Sad was subcloned into 
Sad site of pHN136 (the direction of subcloning was a presimied RepB gene ORP-tsr gene 
ORF-ampicillin resistance gene ORF from 5* end of the DNA). The resultant plasmid was 
named pHN144. 

Next, a DNA fragment of 1.1 kb obtained by double digestion of pHN62 by Xbal and 
Kpnl was subcloned into Xbal and Kpnl sites of plasmid pHN144. The resultant plasmid was 
named pHN152. 

Next, a DNA fragment of 1.2 kb obtained by double digestion of pHN153 by ^^rGI 
and Xbal was subcloned into ^^rOI and Spel sites of plasmid pHN153. The resultant 
plasmid was named pHN154. 

Next, a DNA fragment of 1.6 kb obtained by double digestion of pHN169 hy Xbal and 
Spel was subcloned into Xbal site of plasmid pHN154 (the direction of subcloning was tsr 
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gene ORF-tetracycline resistance gene ORP-77ic^ gene promoter sequence from 5* end of the 
DNA). As a result, a plasmid containing PIP gene under control of the TipA gene promoter 
was created, and the resultant plasmid was named pHN170. 

The ribosome binding site downstream of the TipA gene promoter was changed to a 
sequence which was derived from lambda phage genelO and considered to be good in 
translation efficiency (Gold and Stormo, Methods Enzymol. 185 89-93 (1990)) for enhancmg 
expression of recombinant protein (Fig. 6). Details are specifically described below. 

Amplification by PGR was performed using plasmid pHN170 as a template along with 
primers of SEQ ID Nos. 21 and 37 in the Sequence Listing. Consequently, a hybrid promoter 
consisting of the TipA gene promoter and the binding site derived from lambda phage gene 10 
(hereinafter designated as TipA-LGlO promoter: also designated TipA-LGlOp in the drawing) 
was obtained. Double digestion of this DNA fragment of 0.2 kb was carried out with 
restriction enzymes BsrGl and Ncol, and the product was subcloned into BsrGl and Ncol sites 
of pHN170. As a result, a plasmid containing PIP gene under control of the TipA-LGlO 
promoter was created, and the resultant plasmid was named pHN171. The TipA promoter 
sequence is shown in Fig. 22 and an improvement of the ribosome binding site (RBS) 
sequence for the modification of the TipA promoter to the TipA-LGlO promoter is shown in 
Fig. 23. 
[Example 8] 

Construction of vector plasmid pTip-NHl, pTip-CHl, pTip-LNHl, and pTip-LCHl 

The following procedure was performed in order to remove the PIP gene, which is a 
reporter, from the plasmid described in Example 7 and to mtroduce a multicloning site (Fig. 
7)- 

The synthetic oligodeoxyribonucleotides described in SEQ ID Nos. 38 and 39 in the 
Sequence Listing contain sequences to be multicloning sites and have complementary 
sequences to each other. These two were mixed in the same molar amount, processed at 
70°C for 10 minutes, and cooled to room temperature for converting them for 20 minutes into 
a double-strand. As a result, the end parts thereof became linkable to a vector obtained by 
double digestion by Ncol and Spel and this double-stranded synthetic DNA (designated as 
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MCS Linker NNco in the drawing) was subcloned into Ncol and Spel sites of pHN170. The 
resultant plasmid was named pTip-NHl. Furthermore, the synthetic 

oligodeoxyribonucleotides described in SEQ ID Nos. 40 and 41 in the Sequence Listing, 
which contain sequences to be multicloning sites and have complementary sequences to each 
other, were converted into a double-stranded synthetic DNA in the same way (designated as 
MCS Linker CNco in the drawing) was subcloned into Ncol and Spel sites of pHN170. The 
resultant plasmid was named pTip-CHl . 

The hybrid DNA consisting of the TipA gene promoter sequence and the binding site 
derived from lambda phage gene 10 as described in Example 7 was subjected to double 
digestion by restriction enzymes BsrGl and Ncol, and subcloned into BsrGl and Ncol sites of 
pTip-NHl and pTip-CHI, respectively. The resultant plasmids were named pTip-LNHl and 
pTip-LCHL 
[Example 9] 

Construction of vector plasmid pTip-NH2, pTip-CH2, pTip-LNH2, and pTip-LCH2 

The following procedure was performed to change the Ncol site located most upstream 
in the multicloning site to Ndel in the plasmids pTip-NHl, pTip-CHI, pTip-LNHl and 
pTip-LCHl described in Example 8 (Fig. 8). 

Amplification by PCR was performed using plasmid pHN170 as a template along with 
primers of SEQ ID Nos. 21 and 42 in the Sequence Listing. Consequently, DNA consisting 
of the TipA gene promoter was obtained. Double digestion of this DNA fragment of 0.2 kb 
was carried out with restriction enzymes BsrGl and Ndel, and the product was subcloned into 
BsrGl and Ndel sites of pHNl 70. The resultant plasmid was named pHNl 83. 

The synthetic oligodeoxyribonucleotides described in SEQ ID Nos. 43 and 44 in the 
Sequence Listing contain sequences to be multicloning sites and have complementary 
sequences to each other. These two were mixed in the same molar amount, processed at 
70''C for 10 minutes, and cooled to room temperature for converting them for 20 minutes into 
a double-strand. As a result, the end parts thereof became linkable to a vector obtained by 
double digestion by Ndel and Spel and this double-stranded synthetic DNA (designated as 
MCS Linker NNde in the drawing) was subcloned into Ndel and Spel sites of pHN183. The 
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resultant plasmid was named pTip-NH2. Furthermore, the synthetic 

oligodeoxyribonucleotides described in SEQ ID Nos. 45 and 46 in the Sequence Listing, 
which contain sequences to be multicloning sites and have complementary sequences to each 
other, were converted into a double-stranded synthetic DNA (designated as MCS Linker CNde 
in the drawing) was subcloned into Ndel and Spel sites of pHN183. The resultant plasmid 
was named pTip-CH2. 

Amplification by PCR was performed using plasmid pTip-LNHl as a template along 
with primers of SEQ ID Nos. 21 and 47 in the Sequence Listing. Consequently, a hybrid 
DNA consisting of the TipA gene promoter and the binding site derived from lambda phage 
gene 10 was obtained. Double digestion of this DNA fragment of 0.2 kb was carried out with 
restriction enzjones BsrGl and Ndel, and the product was subcloned into BsrGl and Ndel sites 
of pHN170 sites of pTip-NH2 and pTip-CH2, respectively. The resultant plasmids were 
named pTip-LNH2 and pTip-LCH2, respectively. 

The map of the plasmids created in Examples 8 and 9 and the sequence around the 
multicloning site are summarized in Fig. 9. In the drav^ng, the soUd arrow indicates an 
inverted repeat sequence which exists in the TipA gene promoter. The hatched arrow 
indicates an inverted repeat sequence which exists in the transcription termination sequence of 
ThcA gene. RBS, -10 region and -35 region generally present in the promoter domain of a 
procaryote and important for transcription of a gene, are boxed. In addition, the most 
important SD sequence (Shine and Dalgamo, Eur. J. Biochem. 57 221-230 (1975)) in RBS is 
imderlined. 
[Example 10] 

Construction of vector plasmid pTip-CHl.l, pTip-CH2.1, pTip-LCHl.l and pTip-LCH2.1 

The following procedure was performed in order to make the reading frame after the 

Xhol site of the multicloning site to agree with the reading frame of a commercial pET vector 

(Novagen) in the plasmids pTip-CHl, pTip-CH2, pTip-LCHl and pTip-LCH2 described in 

Examples 8 and 9 (Fig. 10). 

AmpUfication by PCR was performed using plasmid pTip-CHI as a template along 

with primers of SEQ ID Nos. 21 and 104 in the Sequence Listing. Consequently, DNA 
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containing the TipA gene promoter and a multicloning site was obtained. Double digestion of 
this DNA fragment of 0.3 kb was carried out with restriction enzymes BsrGl and Spel, and the 
product was subcloned into BsrGI and Spel sites of pTip-CHl. The resultant plasmid was 
named pTip-CHl . 1 . 

Amplification by PGR was performed using plasmid pTip-CH2 as a template along 
with primers of SEQ ID Nos. 21 and 104 in the Sequence Listing. Consequently, DNA 
containing the TipA gene promoter and a multicloning site was obtained. Double digestion of 
this DNA fragment of 0.3 kb was carried out with restriction enzymes BsrGl and Spel, and the 
product was subcloned into BsrGl and Spel sites of pTip-CHl. The resultant plasmid was 
named pTip-CH2. 1 . 

Amplification by PGR was performed using plasmid pTip-LGHl as a template along 
with primers of SEQ ID Nos. 21 and 104 in the Sequence Listing. Consequently, DNA 
containing the TipA-LGlO promoter and a multicloning site was obtained. Double digestion 
of this DNA fragment of 0.3 kb was carried out with restriction enzymes BsrGl and Spel, and 
the product was subcloned into BsrGl and Spel sites of pTip-GHl. The resultant plasmid 
was named pTip-LGHl . 1 . 

Amplification by PGR was performed using plasmid pTip-LGH2 as a template along 
with primers of SEQ ID Nos. 21 and 104 in the Sequence Listing. Consequently, DNA 
containing the TipA-LGlO promoter and a multicloning site was obtained. Double digestion 
of this DNA fragment of 0.3 kb was carried out with restriction enzymes BsrGl and Spel, and 
the product was subcloned into BsrGl and Spel sites of pTip-GHl. The resultant plasmid 
was named pTip-LCH2. 1 . 
[Example 11] 

Construction of vector plasmids pHN172 and pHN173 

In order to investigate whether the induction of expression is strictly regulated, the 
following plasmids for control experiment were created (Fig. 11). 

A DNA fragment of 1 .6 kb obtamed by double digestion of pHN169 by Xbal and Spel 
was subcloned into Xbal site of plasmid pHN144 (the direction of subcloning was tsr gene 
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ORF-tetracycline resistance gene ORF-ampicillin resistance gene ORF from 5' end of the 
DNA). The resuhant plasmid was named pHN 1 72. 

Next, a DNA fragment of 1.2 kb obtained by double digestion of pHN153 by ^^rGI 
and Xbal was subcloned into BsrGl and Spel sites of plasmid pHN144. The resultant 
plasmid was named pHN164. Subsequently, a DNA fragment of 1.6 kb obtained by double 
digestion of pHN164 by Xbal and Spel was subcloned into Xbal site of plasmid pHN164(the 
direction of subcloning was tsr gene ORF-tetracycline resistance gene ORF-ampicillin 
resistance gene ORF from 5' end of the DNA). The resultant plasmid was named pHN173. 

pHN170 has both of a gene cassette having three elements of the TipA gene promoter, 
PIP ORF linked downstream thereof and the ThcA transcription termination sequence fiirther 
downstream thereof (hereinafter referred to as expression cassette) and a gene cassette having 
two elements of the ThcA gene promoter and the TipA gene downstream thereof (hereinafter 
referred to as inducer cassette). pHN173 has only expression cassette and pHN172 does not 
have either of the cassettes. 
[Example 12] 

Transformation of a bacterium of genus Rhodococcus 

Rhodococcus erythropolis JCM 3201 strain is subjected to shaking culture in 100 ml of 
LB culture media at 30*^0 until the logarithmic growth phase. The culture solution is 
ice-cooled for 30 minutes and centrifuged to collect cell bodies. 100 ml of ice-cooled 
sterilized water is added to this, agitated well and centrifiiged again to collect cell bodies. 
100 ml of ice-cooled 10% glycerin solution is added to this, agitated well and centrifiiged to 
collect cell bodies. Washing with ice-cooled 10% glycerin solution is repeated once again, 
and the cell bodies are suspended in 5 ml of ice-cooled 10% glycerin solution. The solution 
is dispensed into 400 \x\ aliquots, instantly frozen by liquid nitrogen and it stored at -80^C until 
it is used. The cell bodies were taken out from -80°C, thawed on ice, and a plasmid of 
pHN170 or pHN172 or pHN173 was added in an amount of 3 \x\ (respectively about 300 ng). 
This mixed solution of cell bodies and DNA is transferred to an electroporation cuvette 
(Bio-Rad: 0.2cm gap cuvette) and electrically pulsed by using Gene Pulsar II, gene 
introduction equipment of the same company with the electric field of 12.5 kV/cm, under 
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setup conditions of the pulse controller at 25 |aF of capacitance and 400 Q of external 
resistance, respectively. The mixed solution of cell bodies and DNA subjected to electric 
pulsed processing was mixed to 1 ml of LB culture medium, and after cultured at 30°C for 4 
hours, the cell bodies were collected and applied to LB agar culture medium containing 20 
|ag/ml of tetracycline (agar concentration: 1.8%), and cultured at 30**C for 3 days to obtain 
each of the transformants. 
[Example 13] 

Measurement 1 of the PIP activity in a bacterium of genus Rhodococcus 

The constructed expression vector contains PIP gene as a reporter gene, and the 
inductivity by thiostrepton and the strength of induction etc. can be confirmed by measuring 
the enzyme activity of PIP. The amount of PIP which exists in the cell bodies can be easily 
quantified by observing the hydrolyzation activity on an artificial substrate H-Pro-P NA 
(product of Bachem). 

The transformant of Rhodococcus erythropolis JCM3201 strain created in Example 12 
is cultured at 30°C in an LB culture medium containing 8 ^ig/ml of tetracycline, and when the 
optical density (O.D.600) measured at the wavelength of 600 ran reached 0.6, thiostrepton 
(solvent is dimethylsulfoxide) was added so that the final concentration of thiostrepton might 
be 1 |ig/ml thereby inducing the expression of PIP. 

A part of culture solution is taken out in 16 hours, the volume is raised to total 200 \\\ 
by LB culture medium containing 8 ^ig/ml tetracycline and warmed at 60^C for 1 minute. 2 
|j.l of H-Pro-p NA (100 mM: solvent dimethylsulfoxide) is added thereto as a substrate of PIP 
and incubated at 60*^0 for 20 minutes (60'*C is optimal temperature of PIP). In order to 
observe free p NA hydrolyzed firom H-Pro-p NA by PIP, 134 |al of Fast Garnet GBC Salt 
solution (product of Wako Pure Chemicals Company; concentration: 0.5 mg/ml; solvent: 1 M 
acetic acid sodium buffer (pH 4.2) and 10% Triton X-100) is added as a coloring agent. The 
above-mentioned mixed solution exhibits yellow color when PIP is not expressed, while 
exhibiting red color when PIP is expressed. The exhibited red color was measvired by 
measuring absorbance at 550 nm (A550) using a spectrophotometer to quantify the PIP 
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activity. The measurement was performed by adding and diluting 666 of sterilized water 
after adding FastGamet GBC Salt. 

Since the optical density of the cell is also measured at 550 nm in this measurement, the 
optical density of the cell at 550 nm (O.D.550) is separately measured, and the value 
equivalent to O.D.550 used at the time of measurement is subtracted from the value of A550 
to obtain a corrected value Ac550. That is, Ac550 can be calculated by: 
Ac550 = A550 - O.D.550 x (amount (ml) of culture solution used for measuring PIP activity). 
The unit value was defined as "the value of Ac550 obtained by the measurement for 20 
minutes per 1 ml of culture solution per O.D. 600= 1" calculated by " Ac550 / (amoimt (ml) of 
culture solution used for measuring PIP activity) /O.D.600." 

The cells transformed by pHN170 exhibited yellow color when they were cultured 
without adding thiostrepton, whereas they exhibited red color when thiostrepton was added. 
As for the cells transformed by pHN172 and pHN173, they exhibited yellow color either in the 
presence or absence of thiostrepton. 

The transformant of Rhodococcus erythropolis JCM3201 strain created in Example 12 
is cultured at 30**C in an LB culture medium containing 8 (ig/ml of tetracycline, and when 
O.D.600 reached 2.0, the mediimi was cooled to a temperature of 4*^0 immediately and 
cultured with shaking for one hour in order to make cell bodies naturalized. Thiostrepton 
was added so that the final concentration thereof might be 1 |ag/ml to induce the expression of 
PIP. A part of culture solution was taken out in 40 hours, and the same experiment as the 
above-mentioned was conducted at 30°C. 

The cells transformed by pHNlTO exhibited yellow color when they were cultured 
without adding thiostrepton, whereas they exhibited red color when thiostrepton was added. 
As for the cells transformed by pHN172 and pHN173, they exhibited yellow color either in the 
presence or absence of thiostrepton. 

The above result is summarized in Fig. 12. 

As shown in Fig. 12, Rhodococcus erythropolis JCM3201 strain was transformed by 
pHN170, pHN172 and pHN173, and PIP activity was measured respectively for the cases 
where PIP is expressed or not expressed at 30^C or 4®C. Whether the thiostrepton is added 
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(+) or not (-) with a final concentration of 1 ^ig/ml, activity value, culturing temperature, 
amount of culturing solution, transformed plasmid and whether the plasmid has "cassette" (+) 
or not (-) are shown in Fig. 12. 

It was confirmed from these results that the expression vector which can be induced by 
thiostrepton in the wide temperature range had been constructed. 
[Example 14] 

Measurement 2 of the PIP activity in a bacterium of genus Rhodococcus 

The PIP activity of the cells transformed from Rhodococcus erythropolis JCM3201 
strain by pHN170 and pHN171 created in Example 12 were measured in the same manner as 
in Example 13. 

The results of measuring PIP activity with time after adding thiostrepton with a final 
concentration of 1 fxg/ml are shown in Fig. 13. This figure shows the activity with time of 
Rhodococcus erythropolis JCM3201 strain transformed by pHN170, and made to express PIP 
at 30°C and 4®C. In Fig. 13, the vertical axis indicates the activity value (unit) of PIP and the 
horizontal axis indicates the time after thiostrepton with a final concentration of 1 |ag/ml is 
added. The symbol "O" at 4^*0 shows activity when O.D.600 is 1.0 and an expression 
induction is started, and the symbol "□" shows activity when O.D.600 is 2.0 and an expression 
induction is started. The symbol "O" at 30®C shows activity when O.D.600 is 0.6 and an 
expression induction is started, and the symbol shows activity when O.D.600 is 1.0 and an 
expression induction is started. 

Fig. 14 shows the results in the case where the final concentration of thiostrepton added 
was varied. In this case, induction expression time was 2400 minutes (40 hours) in the case 
where induction was started at 4°C when O.D.600=2.0, and 960 minutes (16 hours) in the case 
where induction was started at 30'^C when O.D.600= 0.6. Examples shown in Fig. 14 show 
the PIP activity of the cells transformed from Rhodococcus erythropolis JCM3201 strain by 
pHN170 measured at various concentrations of thiostrepton added to express PIP at 30°C and 
4°C. In Fig. 14, the vertical axis indicates the activity value (unit) of PIP and the horizontal 
axis indicates the final concentration (|Lig/ml) of thiostrepton added in the culture medium. 
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These results have revealed that 1 iig/ml of thiostrepton is enough for expression 
induction either at 30®C or 4®C. Moreover, although depending on the timing of expression 
induction, the amount of expression of PIP per cell reaches the maximum for 500 to 1000 
minute (about 8 to 16 hours) as for the case of 30**C, and 3000 minutes (50 hours) or more as 
for the case of 4®C. 
[Example 15] 

Measurement 3 of the PIP activity in a bacterium of genus Rhodococcus 

Rhodococcus erythropolis JCM3201 strain, Rhodococcus fascians JCM 10002 strain 
and Rhodococcus opacus DSM44193 strain are transformed by pHN170 in the same manner 
as in Example 12. Consequently, the transformation not only of Rhodococcus erythropolis 
but also Rhodococcus fascians and Rhodococcus opacus was able to be carried out by 
pHN170. Therefore, it was shown that the autonomous replication starting point derived 
from Rhodococcus erythropolis JCM2895 strain introduced into pHN170 functions also in 
Rhodococcus fascians and Rhodococcus opacus. The PIP activity was measured using these 
transformants in the same manner as in Example 13. In each of the strains, induction 
expression time was 2400 minutes (40 hours) in the case where induction was started at 4''C 
when O.D.600=2.0, and 960 minutes (16 hours) in the case where induction was started at 
SO^'C when O.D.600= 0.6. The results are shown in Fig. 15. In Fig. 1 5, the PIP activity was 
measured for the cells transformed from Rhodococcus erythropolis JCM3201 strain, 
Rhodococcus fascians JCM 10002 strain and Rhodococcus opacus DSM44193 strain by 
pHN170 and for each case where PIP is expressed or not expressed at 30®C or 4°C. 
Whether the thiostrepton is added (+) or not (-) with a final concentration of 1 iiig/ml, activity 
value, culturing temperature, amount of culturing solution, transformed plasmid and the hosts 
transformed by pHN170 are shown in Fig. 15. 

Every bacterium of genus Rhodococcus transformed by pHN170 exhibited yellow color 
when they were cultured without adding thiostrepton, whereas they exhibited red color when 
thiostrepton was added. However, in Rhodococcus fascians JCM 10002 strain and 
Rhodococcus opacus DSM44193 strain, expression was low compared with Rhodococcus 
erythropolis JCM320\ strain. 
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[Example 16] 

Expression and purification 1 of a foreign protein in a bacterium of genus Rhodococcus 

The transformation of Rhodococcus erythropolis JCM3201 strain was carried out in the 
same manner as in Example 12, and PIP was made to express at 30°C and 4°C in the similar 
manner as in Example 13 using pHN170 (described in Example 7) and pHN171 (described in 
Example 7), respectively. Here, after adding thiostrepton with a final concentration of 1 
^g/ml, cell bodies were collected with time and PIP was purified. The PIP has 6xHis tag 
linked at C-terminal thereof, and purified using nickel-NTA Superflow (product of Qiagen) 
following the manufacturer's instructions. 

The purification process was specifically described below, where the procedure of 
purification was performed at 4**C. The cell bodies which expressed protein (a part for 20 ml 
culture solution) were collected, and suspended into 1 ml of NT-Buffer (50 mM Tris-HCl (pH 
8.0), 100 mM sodium chloride, 1 mM dithiothreitol), and 1 g of glass beads (diameter of 0.105 
- 0.125 mm) was added. The cells were destroyed by reciprocally shaking by Fast-prep 
FP120 (product of SAVANT) with 6 m/second in speed for 20 seconds. After centrifuged in 
20,000xg, nickel-NTA Superflow which was beforehand equilibrated with NT-Buffer was 
added to 700 |i.l of the supematant so that the bed volume might be 40 fxL This mixture was 
subjected to rotation agitation for 1 hour to combine the nickel-NTA Superflow beads to the 
protein attached with 6xHis tag. After washing these beads 4 times by NT-Buffer, the 
protein was eluted from the protein attached with 6xHis tag by suspending these beads in 120 
Hl of NTE-Buffer (50 mM Tris-HCl (pH 7.0), 100 mM sodium chloride, 1 mM dithiothreitol, 
400 mM imidazole) 3 times. 

The results where 10 (il of the above-mentioned samples was subjected to 12% of SDS 
polyacrylamide electrophoresis according to an ordinary method are shown in Fig. 16. The 
transformation of Rhodococcus erythropolis JCM3201 strain was carried out by pHN170 
(expression firom the TipA gene promoter: two panels on the left), and pHN171 (expression 
from the TipA-LGlO promoter: two panels on the right), and PIP was made to express at 4'*C 
(the upper two panels) and 30®C (lower two panels). After adding thiostrepton with the final 
concentration of 1 |Lig/ml, cell bodies were collected with time and purified using nickel-NTA 
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Superflow using the 6xHis tag attached to C-terminal of PIP. The timing for collecting cell 
bodies is set 0 minute (leftmost lane), 180 minutes (second lane from the left), 420 minutes 
(third lane from the left), 1080 minutes (fourth lane from the left), 1440 minutes (fifth lane 
from the left), 1860 minutes (sixth lane from the left), 2520 minutes (seventh lane from the 
left) and 3060 minutes (eight lane from the left) at 4°C and 0 minute (leftmost lane), 120 
minutes (second lane from the left). They are 240 minutes (third lane from the left), 420 
minutes (fourth lane from the left), 540 minutes (fifth lane from the left), 720 minutes (sixth 
lane from the left), 900 minutes (seventh lane from the left) and 1440 minutes (eight lane from 
the left) at 30®C. In every panel in Fig. 16, the rightmost lane shows the sample purified 
from cell bodies which continued culturing without inducing (namely, without adding 
thiostrepton). At BO^'C, expression from the TipA-LGlO promoter was a little lower 
compared with the expression from the TipA gene promoter, whereas at 4*^C, expression from 
the TipA-LGlO promoter was conversely higher. In addition, the induction of expression was 
controlled strictly also in TipA-LGlO promoter. 

Detailed comparison in the amoimt of expression from both the promoters are 
described in Example 18 in detail. 
[Example 17] 

Expression and purification 2 of a foreign protein in a bacterium of genus Rhodococcus 

The transformation of Rhodococcus erythropolis JCM3201 strain was carried out in the 
same manner as in Example 12, and PIP was made to express in the similar manner as in 
Example 13 using pHN170 (described in Example 7), and pHN171 (described in Example 7), 
respectively at 32''C, BO'^C, 15°C, and 4°C. Induction expression time was 2400 minutes (40 
hours) in the case where induction was started at 4®C when O.D.600=2.0, 1500 minutes (25 
hours) in the case where induction was started at 15°C when O.D.600=L0, 960 minutes (16 
hours) in the case where induction was started at 30°C when O.D.600=0.6, and 960 minutes 
(16 hours) in the case where induction was started at 32®C when O.D.600= 0.6. The 
thiostrepton added is in an amoxmt of 1 fig/ml by the final concentration. Purification was 
performed in the same manner as Example 16. 
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The results where 10 jal of the above-mentioned samples was subjected to 12% of SDS 
polyacrylamide electrophoresis according to an ordinary method are shown in Fig. 17. The 
transformation of Rhodococcus erythropolis JCM3201 strain was carried out by pHN170 
(expression from the TipA gene promoter: lanes 1, 3, 5 and 7), and pHN171 (expression from 
the TipA'LGlO promoter: lanes 2, 4, 6 and 8), and PIP was made to express at 4°C (lanes 7 
and 8), 1 5*^C (lanes 5 and 6), 30°C (lanes 3 and 4) and 32°C (lanes 1 and 2). Purification was 
performed using nickel-NTA Superflow utilizing the 6xHis tag linked to C-terminal of PIP. 

In the wide temperature range from 32 to 4°C, expression of PIP from the TipA gene 
promoter and TipA-LGlO promoter was confirmed. The amoxmts of expression of PIP were 
higher from the TipA promoter at 32°C and 30**C, whereas the amounts of expression were 
higher from TipA-LGlO promoter at 15X and 4**C. 
[Example 18] 

Expression and purification 3 of foreign proteins in a bacterium of genus Rhodococcus 

The following experiments were conducted in order to investigate whether a protein 
other than PIP can also be made to express using this expression vector. 

Amplification by PGR was performed using plasmid pRSET-ATPIP as a template 
along with primers of SEQ ID Nos. 48 and 49 in the Sequence Listing. Consequently, DNA 
containing PIP gene derived from Arabidopsis thaliana (Tamura et al., FEBS Lett. 398 
101-105 (1996) — hereinafter abbreviated as AtPIP) was obtained. Double digestion of this 
DNA fragment of 1 .0 kb was carried out with restriction enzymes Ncol and Xhol, and the 
product was subcloned into Ncol and Xhol sites of pTip-CHl and pTip-LCHl. As a result, 
plasmids containing AtPIP gene under control of the TipA gene promoter TipA-LGIO promoter 
were created, and the resultant plasmids were named pHN176 and pHN177, respectively. 

Amplification by PGR was performed using plasmid pTrc99a-GFP as a template along 
with primers of SEQ ID Nos. 50 and 51 in the Sequence Listing. Consequently, DNA 
containing a gene encoding fluorescence green protein derived from Aequorea victoria 
(hereinafter abbreviated as GFP) was obtained. Double digestion of this DNA fragment of 
0.8 kb was carried out with restriction enzymes Ncol and SndRl^ and the product was 
subcloned into Ncol and SndBl sites of pTip-NHl and pTip-LNHl. As a result, plasmids 
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containing GFP (having a 6xHis tag linked to N-terminal) gene under control of the TipA gene 
promoter or TipA-LGIO promoter were created, and the resultant plasmids were named 
pHN187 and pHN186, respectively. 

Amplification by PGR was performed using plasmid pGEX-2T (Amersham Bioscience 
Company) as a template along with primers of SEQ ID Nos. 52 and 53 in the Sequence Listing. 
Consequently, DNA containing a gene encoding glutathione-S-transferase protein (hereinafter 
abbreviated as GST) was obtained. Double digestion of this DNA fragment of 0.7 kb was 
carried out with restriction enzymes Ncol and Xhol, and the product was subcloned into Ncol 
and JOiol sites of pTip-NH2 and pTip-LNH2. As a result, plasmids containing GST (having a 
6xHis tag linked to N-terminal) gene under control of the TipA gene promoter or TipA-LGlO 
promoter were created, and the resultant plasmids were named pHN282 and pHN283, 
respectively. 

The transformation of Rhodococcus erythropolis JCM3201 strain was carried out in the 
same manner as in Example 12 using pHN170 (described in Example 7), pHN171 (described 
in Example 7), pHN176, pHN177, pHN187, pHN186, pHN282 and pHN283, and each of the 
proteins PIP, AtPIP, GFP, and GST was made to express in the similar manner as in Example 
13 respectively at 30*'C and 4°C. Induction expression time was 2400 minutes (40 hours) in 
the case where induction was started at 4°C when O.D.600=2.0, 960 minutes (16 hours) in the 
case where induction was started at 30°C when O.D. 600=0.6. The thiostrepton added is in an 
amount of 1 |ag/ml by the final concentration. Purification was performed from 50 ml of 
culture solution at 4°C and firom 20 ml of culture solution at 30®C. 

All of the four above-mentioned proteins have a 6xHis tag and purification was 
performed in the same manner as Example 16. 

The results where 10 |xl of the above-mentioned samples was subjected to 12% of SDS 
polyacrylamide electrophoresis according to an ordinary method are shown in Fig. 18. The 
transformation of Rhodococcus erythropolis JCM3201 strain was carried out by pHN170 (JPIP 
located downstream of the TipA gene promoter: lanes 1 and 9), pHN171 {PIP located 
downstream of the TipA-LGlO promoter: lanes 2 and 10), pHN176 (AtPIP located downstream 
of the TipA gene promoter: lanes 3 and 11), pHN177 (AtPIP located downstream of the 
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TipA'LGlO promoter: lanes 4 and 12), pHN187 (GFP located downstream of the TipA gene 
promoter: lanes 5 and 13), pHN186 (GFP located downstream of the TipA-LGlO promoter: 
lanes 6 and 14), pHN282 (GST located downstream of the TipA gene promoter: lanes 7 and 
15) and pHN283 (GST located downstream of the TipA-LGlO promoter: lanes 8 and 16) and 
the proteins were made to express at A^C (lanes 9 to 16) and SO^'C (lanes 1 to 8). Purification 
was performed using nickel-NTA Superflow utilizing the 6xHis tag linked to an end of each 
protein. 

The results by measuring and quantifying the density of each band by densitometer are 
also shown in Fig. 19. These were quantified firom the bands of SDS polyacrylamide 
electrophoresis shown in Fig, 18, In this figure, purified amoimt for each foreign protein 
fi-om 1 liter of the culture solution is shown. The unit is shown by mg. The rightmost 
column (magnification) indicates that how many times of protein is purified when the protein 
is expressed using the TipA-LGlO promoter compared with the case where the protein is 
expressed using the TipA gene promoter. Consequently, it tums out that that the amount of 
the recombinant protein obtaiined at 4°C was more when expressed firom the TipA-LGlO 
promoter than fi"om TipA gene promoter. However, in the case of 30®C, the amoimt of the 
recombinant protein obtained was not necessarily more when expressed fi-om the TipA-LGlO 
promoter than from TipA gene promoter. 
[Example 19] 

Isolation of the protein derived from mouse which exhibits the proliferation inhibiting effect to 
E, coli at 30'=*C 

In order to investigate which gene exhibits the proliferation inhibiting effect to a host 
when expressed specifically, an expression library for E. coli was constructed using 
Poly(A)^A (product of STRATAGENE) derived from mouse liver. Details are 
specifically described below. 

An arabinose inducible vector was adopted for use in the expression vector for E, coli. 
First, in order to facilitate introduction of cDNA into this vector, pBAD/HisA (product of 
Invitrogen), pBAD-Linker which has a modified multicloning site was created. The creation 
process is described below. Synthetic oligodeoxyribonucleotide described in SEQ ID Nos, 
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54 and 55 in the Sequence Listing which contain a cloning sequences consisting of EcoKl, 
Bglll and Xhol recognition sites and have complementary sequence to each other are mixed in 
an equivalent molar amount, processes at 70''C for 10 minutes, cooled down to room 
temperature for 20 minutes to form a double strand. As a result, the end parts thereof became 
linkable to a vector obtained by double digestion by Ncol and Hindlll and was subcloned into 
Ncol and Hindlll sites of pB AD/HisA. The resultant plasmid was named pBAD-Linker. 

A double stranded cDNA was synthesized from the above-mentioned Poly(A)"^RNA 
using cDNA synthesis kit available from STRATAGENE following the manufacturer's 
instructions. Subsequently, this cDNA was ligated into EcoRl and Xhol sites of 
pBAD-Linker. According to an ordinary method, this ligation product was transformed into 
E. coli TOP 10 (product of Invitrogen), and 50,000 transformants were acquired on LB agar 
culture medium containing 50 ng/ml ampicillin. A replica of this agar culture medium was 
created on LB agar culture medium which contained 50 |ig/ml of ampicillin £uid 0.2% of 
L-arabinose by using Easy Transfer Replica Plating Device available from GenHunter, protein 
expression was induced, and incubated at 30°C ovemight. Consequently, 426 colonies which 
was able to grow on the culture medium not containing arabinose and was not able to grow on 
the cxilture medium containing arabinose were sorted out. 

After culturing this 426 TOP 10 transformant in 1.5 ml of LB culture medium 
containing 50 )ag/ml ampicillin, plasmids were separated and purified following an ordinary 
method. After double digestion of restriction enzymes EcoRl and Xhol, the obtained 
plasmids were subjected to 1% agarose gel electrophoresis and the length of the cDNA 
fragment derived from mouse was estimated. Furthermore, from the obtained plasmid, the 
nucleotide sequence of the cDNA fragment derived from mouse was determined by about 500 
bases using a synthetic oligodeoxyribonucleotides described in SEQ ID No. 56 in the 
Sequence Listing and a DNA sequencer ABI PRISM (R) 3100 Genetic Analyzer, The results 
are shown in Fig. 20. This figure shows the results using BLAST program, performing 
homology search based on the determined DNA sequence to identify the gene. 
[Example 20] 
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Expression and purification of foreign proteins in a bacterium of genus Rhodococcus and E, 
coli 

Among the genes isolated in Example 19, Serum amyloid A (Saal), NADH 
dehydorogenase 1 alpha subcomplex 4, and Cytochrome b5 like, RIKEN1500015G18, 
Transferrin, Apolipoprotein A-V, Pantotenate kinase ip, Peroxiredoxin 4 and 
RIKENl 30001 7J02 (Transferrin Homolog) were expressed using Rhodococcus erythropolis 
JCM3201 and E, coli TOP 10 as hosts. The following four groups and ten kinds of protein 
were expressed similarly. 

1) Group 1: Three kinds of protease which is known to become an inactive inclusion body if 
expressed in E. colU Cathepsin D, Prothrombin, Kallikrein 6; 

2) Group 2: Two kinds of DNAses, LSDNAse and DL AD whose expression in E. coli is 
expected to be difScult from their physiology activity; and 

3) Group 3: Those of which expression is considered to be difficult in E, coli according to the 
researches of other groups due to the cell proliferation inhibiting activity, HMG-1, Kidl, Bax 
alpha; and 

4) Group 4: Those which are solubilized with low temperature dependence according to the 
researches of other groups, Glucokinase and p37A. 

The recombinant proteins were expressed at 30°C and A^C in Rhodococcus 
erythropolis and at 30°C in E. coli, respectively. Details are described below. 

Amplification by PCR was performed using plasmid LE20 as a template along with 
primers of SEQ ID Nos. 57 and 58 in the Sequence Listing. Consequently, DNA containing 
Saal gene (GenBank Accession Number Ml 1131) encoding Serum Amyloid Protein A 
derived from mouse (Meeker et al.. Proteins 30 381-387 (1998)) was obtained. Double 
digestion of this DNA fragment was carried out with restriction enzymes Ndel and Xhol, and 
the product was subcloned into Ndel and Xhol sites of pTip-LNHl. As a result, a plasmid 
containing SaaJ gene under control of the TipA-LGlO promoter was created, and the resultant 
plasmid was named pHN205. In addition, amplification by PCR was performed using 
plasmid LE20 as a template along with primers of SEQ ID Nos. 59 and 60 in the Sequence 
Listing. Consequently, DNA containing Saal gene derived from mouse was obtained. 
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Double digestion of this DNA fragment was carried out with restriction enzymes Xhol and 
Kpnl, and the product w£is subcloned into Xhol and Kpnl sites of pBAD/HisA. As a result, a 
plasmid containing Saal gene under control of the arabinose inducible promoter was created, 
and the resultant plasmid was named pHN193. 

Amplification by PGR was performed using plasmid LI 13 as a template along with 
primers of SEQ ID Nos. 61 and 62 in the Sequence Listing. Consequently, DNA containing 
a gene encoding NADH dehydrogenase 1 alpha subcomplex 4 derived from mouse (Walker et 
al., J. MoL BioL 226 1051-1072 (1992): GenBank Accession Number BC011114; hereinafter 
abbreviated as NADH4) was obtained. Double digestion of this DNA fiiagment was carried 
out with restriction enzymes Ndel and EcoRl^ and the product was subcloned into Ndel and 
EcoRl sites of pTip-LNHl. As a result, a plasmid containing NADH4 gene under control of 
the TipA-LGlO promoter was created, and the resultant plasmid was named pHN206. In 
addition, amplification by PGR was performed using plasmid LI 13 as a template along with 
primers of SEQ ID Nos. 63 and 62 in the Sequence Listing. Gonsequently, DNA containing 
NADH4 gene derived from mouse was obtained. Double digestion of this DNA fragment 
w£is carried out with restriction enzymes Xhol and £coRI, and the product was subcloned into 
Xhol and EcoRl sites of pBAD/HisA. As a result, a plasmid containing NADH4 gene under 
control of the arabinose inducible promoter was created, and the resultant plasmid w£is named 
pHN195. 

Amplification by PGR was performed using plasmid L3 as a template along with 
primers of SEQ ID Nos. 64 and 65 in the Sequence Listing. Gonsequently, DNA containing 
a gene encoding Gytochrome b5 like protein derived from mouse (GenBank Accession 
Number AK002426: hereinafter abbreviated as Cytochrome bSt) was obtained. Double 
digestion of this DNA fragment was carried out with restriction enzymes Ndel and £coRI, and 
the product was subcloned into Ndel and £coRI sites of pTip-LNHl. As a result, a plasmid 
containing Cytochrome b5l gene under control of the TipA-LGlO promoter was created, and 
the resultant plasmid was named pHN208. In addition, amplification by PGR was performed 
using plasmid L3 as a template along with primers of SEQ ID Nos. 66 and 65 in the Sequence 
Listing. Gonsequently, DNA containing Cytochrome b5l gene derived from mouse was 
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obtained. Double digestion of this DNA fragment was carried out with restriction enzymes 
Xhol and EcoRI, and the product was subcloned into Xhol and EcoRI sites of pBAD/HisA. 
As a result, a plasmid containing Cytochrome b5l gene under control of the eirabinose 
inducible promoter was created, and the resultant plasmid was named pHN199. 

Amplification by PGR was performed using plasmid LEI 23 as a template along with 
primers of SEQ ID Nos. 67 and 68 in the Sequence Listing. Consequently, DNA containing 
a gene encoding a putative protein derived from mouse whose function was unknown 
(GenBank Accession Number NM#025439: hereinafter abbreviated as LEI 23) was obtained. 
Double digestion of this DNA fragment was carried out with restriction enzymes Ndel and 
EcoRL, and the product was subcloned into Ndel and EcoRI sites of pTip-LNHl . As a result, 
a plasmid containing LEI 23 gene under control of the TipA-LGlO promoter was created, and 
the resultant plasmid was named pHN287. In addition, amplification by PGR was performed 
using plasmid LEI 23 as a template along with primers of SEQ ID Nos. 69 and 68 in the 
Sequence Listing. Consequently, DNA containing LEI 23 gene derived from mouse was 
obtained. Double digestion of this DNA fragment w£is carried out with restriction enzymes 
Xhol and EcoRI, and the product was subcloned into Xhol and EcoRI sites of pBAD/HisA. 
As a result, a plasmid containing LE123 gene imder control of the arabinose inducible 
promoter was created, and the resultant pleismid was named pHN276. 

Amplification by PGR was performed using plasmid LE280 as a template along with 
primers of SEQ ID Nos. 70 and 71 in the Sequence Listing. Consequently, DNA containing 
a gene encoding Transferrin derived from mouse (Mason et al.. Protein Expr. Purif. 23 
142-150 (2001); GenBank Accession Number BG022986) was obtained. Double digestion 
of this DNA fragment was carried out with restriction enzymes Ndel and Hmdlll, and the 
product was subcloned into Ndel and Hindlll sites of pTip-LNHl. As a result, a plasmid 
containing Transferrin gene under control of the TipA-LGlO promoter was created, and the 
resultant plasmid was named pHN289. In addition, amplification by PGR was performed 
using plasmid LE280 as a template along with primers of SEQ ID Nos. 72 and 71 in the 
Sequence Listing. Consequently, DNA containing Transferrin gene derived from mouse was 
obtained. Double digestion of this DNA fragment was carried out with restriction enzymes 
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Xhol and HindlU^ and the product was subcloned into Xhol and Hindlll sites of pBAD/HisA. 
As a result, a plasmid containing Transferrin gene under control of the arabinose inducible 
promoter was created, £ind the resultant plasmid was named pHN277. 

Amplification by PGR was performed using plasmid LE295 as a template along with 
primers of SEQ ID Nos. 73 and 74 in the Sequence Listing. Consequently, DNA containing 
a gene encoding Apolipoprotein A-V derived from mouse (van der Vliet et al., J. Biol. Chem. 
276 44512-44520 (2001); GenBank Accession Number NM#080434; hereinafter referred to as 
ApoaS) was obtained. Double digestion of this DNA fragment was carried out with 
restriction enzymes Ncol and £coRI, and the product was subcloned into Ncol and EcdRl sites 
of pTip-LNH2. As a result, a plasmid containing ApoaS gene under control of the 
TipA'LGlO promoter was created, and the resultant plasmid was named pHN288. In addition, 
amplification by PGR was performed using plasmid LE295 as a template along with primers 
of SEQ ID Nos. 75 and 74 in the Sequence Listing. Consequently, DNA containing ApoaS 
gene derived from mouse was obtained. Double digestion of this DNA fragment was carried 
out with restriction en2ymes Xhol and £coRI, and the product was subcloned into Xhol and 
^coRI sites of pBAD/HisA. As a result, a plasmid containing ApoaS gene under control of 
the arabinose inducible promoter was created, and the resultant plasmid was named pHN281 . 

Amplification by RT-PCR (Larrick, Trends Biotechnol. 10 146-152 (1992) was 
performed using mouse liver Poiy(A)'^RNA along with primers of SEQ ID Nos. 76 and 77 in 
the Sequence Listmg. RT-PCR was performed by using ProSTAR Ultra HF RT-PCR System 
available from STRATAGENE following the manufacturer's instructions (hereinafter all 
RT-PCR was performed using this kit below). Consequently, DNA containing Cathepsin D 
gene derived from mouse (Grusby et al.. Nucleic Acids Res. \% 4008 (1990); Babe et al.. 
Biotechnology and Genetic Engineering Reviews 17 213-252:(2000); GenBank Accession 
Number X52886) was obtained. Double digestion of this DNA fragment was carried out 
with restriction enzymes Ncol and Xhol^ and the product was subcloned into Ncol and Xhol 
sites of pTip-LCHl . As a result, a plasmid containing Cathepsin D gene under control of the 
TipA-LGlO promoter was created, and the resultant plasmid was named pHN270. 
Furthermore, a DNA fragment of 1 .2 kb obtained by double digestion of pHN270 by Ncol and 
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San was subcloned into Ncol and Xhol sites of pBAD/HisA. The resultant plasmid was 
named pHN273. 

Amplification by RT-PCR was perfomied using mouse liver Poly(A)^RNA along with 
primers of SEQ ID Nos. 78 and 79 in the Sequence Listing. Consequently, DNA containing 
Prothrombin gene derived from mouse (Degen et al., DNA Cell Biol.9 487-498 :( 1990); 
GenBank Accession Nimiber X52308) was obtained. Double digestion of this DNA 
fragment was carried out with restriction enzymes Ncol and Xhol, and the product was 
subcloned into Ncol and Xhol sites of pTip-LCHl. As a result, a plasmid containing 
Prothrombin gene under control of the TipA-LGlO promoter was created, and the resultant 
plasmid was named pHN271 . 

Amplification by RT-PCR was performed using mouse liver Poly(A)^RNA along with 
primers of SEQ ID Nos. 80 and 81 in the Sequence Listing. Consequently, DNA containing 
Kallikrein 6 gene derived from mouse (Evans et al., J.Biol. Chem.262 8027-8034 (1987); Babe 
et al., Biotechnology and Genetic Engineering Reviews 17 213-252:(2000); GenBank 
Accession Number NM#010639) was obtained. Double digestion of this DNA fragment was 
carried out with restriction enzymes Ncol and Xhol, and the product was subcloned into Ncol 
and Xhol sites of pTip-LCHl. As a result, a plasmid containing Kallikrein 6 gene under 
control of the TipA-LGlO promoter was created, and the resultant plasmid was named pHN272. 
Furthermore, a DNA fragment of 0.7 kb obtained by double digestion of pHN272 by Ncol and 
Sail was subcloned into Ncol and Xhol sites of pBAD/HisA. The resultant plasmid was 
named pHN275. 

Amplification by RT-PCR was performed using mouse liver Poly(A)^RNA along with 
primers of SEQ ID Nos. 82 and 83 in the Sequence Listing. Consequently, DNA containing 
LSDNAse gene derived from mouse (Baron et al., Gene 215 291-301:(1998); GenBank 
Accession Number AF 047355) was obtained. Double digestion of this DNA fragment was 
carried out with restriction enzymes Ndel and Xhol, and the product was subcloned into Ndel 
and Xhol sites of pTip-LNHl . As a result, a plasmid containing LSDNAse gene under control 
of the TipA-LGlO promoter was created, and the resultant pleismid w£is named pHN299. 
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Amplification by RT-PCR was performed using mouse liver Poly(A)^RNA along with 
primers of SEQ ID Nos. 84 and 85 in the Sequence Listing. Consequently, DNA containing 
DLAD gene derived from mouse (Shiokawa and Tanuma, Nucleic Acids Res.27 
4083-4089:(1999); GenBank Accession Number AF 128888) was obtained. Double 
digestion of this DNA fragment was carried out with restriction enzymes Ncol and £coRI, and 
the product was subcloned into Ncol and EcdRi sites of pTip-LNH2. As a resuh, a plasmid 
containing DLAD gene under control of the TipA-LGlO promoter was created, and the 
resultant plasmid was named pHN284. 

Amplification by RT-PCR was performed using mouse liver Poly(A)^RNA along with 
primers of SEQ ID Nos. 86 and 87 in the Sequence Listing. Consequently, DNA containing 
HMG-1 gene derived from mouse (Pauken et al., Mancmi.Genome 5 91-99 (1994); Lee et al.. 
Gene 225 97.105:(1998); GenBank Accession Number U00431) was obtained. Double 
digestion of this DNA fragment was carried out with restriction enzymes Ncol and Eco'Rl, and 
the product was subcloned into Ncol and EcoKl sites of pTip-LNH2. As a result, a plasmid 
containing HMG-1 gene under control of the TipA-LGlO promoter was created, and the 
resultant plasmid was named pHN285. In addition, amplification by PGR was performed 
using plasmid pHN285 as a template along with primers of SEQ ID Nos. 88 and 87 in the 
Sequence Listing. Consequently, DNA containing HMG-l gene derived from mouse was 
obtained. Double digestion of this DNA fragment was carried out with restriction enzymes 
Ncol and EcoRl^ and the product was subcloned into Ncol and EcoRl sites of pBAD/HisA. 
As a result, a plasmid containing HMG-l gene xmder control of the arabinose inducible 
promoter was created, and the resultant plasmid was named pHN305. 

Amplification by RT-PCR was performed using mouse liver Poly(A)''^NA along with 
primers of SEQ ID Nos. 89 and 90 in the Sequence Listing. Consequently, DNA containing 
Kidl gene derived from mouse (Tekki-Kessaris et al., Gene 240 13-22 (1999); 
Suter-Crazzolara and Unsicker Bio/Technology 19 202-204:(1995); GenBank Accession 
Number AF 1841 1 1) was obtained. Double digestion of this DNA fragment was carried out 
with restriction enzymes Ncol and Hindlll, and the product was subcloned into Ncol and 
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Hindlll sites of pTip-LNH2. As a result, a plasmid containing Kidl gene under control of the 
TipA-LGlO promoter was created, and the resultant plasmid was named pHN286. 

Amplification by RT-PCR was performed using mouse liver Poly(A)'^RNA along with 
primers of SEQ ID Nos. 91 and 92 in the Sequence Listing. Consequently, DNA containing 
Bcoc alpha gene derived from mouse (Oltvai et al.. Cell 74 609-619 (1993); Donnelly et al.. 
Protein Expr.Purif.22 422-429:(2001); GenBank Accession Number L22472) was obtained. 
Double digestion of this DNA fragment was carried out with restriction enzymes Ndel and 
£coRJ, and the product was subcloned into Ndel and EcoRl sites of pTip-LNHl. As a result, 
a plEismid containing Box alpha gene imder control of the TipA-LGlO promoter was created, 
and the resultant plasmid was named pHN217. In addition, amplification by PCR was 
performed using plasmid pHN217 as a template along with primers of SEQ ID Nos. 93 and 92 
in the Sequence Listing. Consequently, DNA containing Bax alpha gene derived from 
mouse was obtained. Double digestion of this DNA fragment was carried out v^th restriction 
enzymes Xhol and £coRI, and the product was subcloned into Xhol and EcoKL sites of 
pBAD/HisA. As a result, a plasmid containing Bax alpha gene under control of the 
arabinose inducible promoter was created, and the resultant plasmid was named pHN212. 

Amplification by RT-PCR was performed using mouse liver Poly(A)^TlNA along with 
primers of SEQ ID Nos. 94 and 95 in the Sequence Listing. Consequently, DNA containing 
Glucokinase gene derived from mouse (Lin et al., Protein Expr.Purif.i 169-176(1990); 
GenBank Accession Number BC011139) was obtained. Double digestion of this DNA 
fragment was carried out with restriction enzymes Ndel and Xhol^ and the product was 
subcloned into Ndel and Xhol sites of pTip-LNHl. As a result, a plasmid containing 
Glucokinase gene under control of the TipA-LGlO promoter was created, and the resultant 
plasmid was named pHN298. Furthermore, a DNA fragment of 1.4 kb obtained by double 
digestion of pHN298 by Ncol and Xhol was subcloned into Ncol and Xhol sites of 
pBAD/HisA. The resultant plasmid was named pHN306. 

Amplification by PCR was performed using plasmid pET22 b-Dmp37A as a template 
along with primers of SEQ ID Nos. 105 and 96 in the Sequence Listing. Consequently, DNA 
containing a gene encoding p37A (Holzl et al., J.Cell Biol.150 119-129:(2000); GenBank 
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Accession Number AF 1453 12) derived from Drosophila melanogaster was obtained. 
Double digestion of this DNA fragment was carried out with restriction enzymes Ndel and 
Xhol^ and the product was subcloned into Ndel and Xho\ sites of pTip-LCH2. As a result, a 
plasmid containing p37A gene under control of the TipA-LGlO promoter was created, and the 
resultant plasmid was named pHN291. In addition, amplification by PGR was performed 
using plasmid pHN291 as a template along with primers of SEQ ID Nos. 97 and 25 in the 
Sequence Listing. Consequently, DNA containing p37A gene derived from Drosophila 
melanogaster was obtained. This DNA fragment was digested by restriction enzymes Sail 
and then partially digested by Ncol (so that digestion might not occur at Ncol inside p37A\ 
and the product was subcloned into Ncol and Xhol sites of pBAD/HisA. As a result, a 
plasmid containing p37A gene under control of the arabinose inducible promoter was created, 
and the resultant plasmid was named pHN308. 

Amplification by PGR was performed using plasmid LE59 as a template along with 
primers of SEQ ID Nos. 98 and 99 in the Sequence Listing. Consequently, DNA containing 
a gene encoding Pantothenate kinase 1 beta protein (Rock et al., J. Biol.Ghem. 275 
1377-1383:(2000); GenBank Accession Number AF200357— hereinafter abbreviated as 
PanK) derived from mouse was obtained. Double digestion of this DNA fragment was 
carried out with restriction enzymes Xhol and EcoRl, and the product was subcloned into Xhol 
and ^coRI sites of pBAD/HisA. As a result, a plasmid containing PanK gene under control 
of the arabinose inducible promoter was created, and the resultant plasmid was named 
pHN279. 

Amplification by RT-PGR was performed using mouse liver Poly(A)^RNA along with 
primers of SEQ ID Nos. 100 and 101 in the Sequence Listing. Consequently, DNA 
containing a gene encoding Peroxiredoxin 4 derived from mouse (GenBank Accession 
Number BG019578) was obtained. Double digestion of this DNA fragment was carried out 
with restriction enzymes Xhol and Kpnl^ and the product was subcloned into Xhol and Kpnl 
sites of pBAD/HisA. As a result, a plasmid containing Peroxiredoxin 4 gene under control of 
the arabinose inducible promoter was created, and the resultant plasmid was named pHN278. 
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Amplification by PGR was performed using plasmid LEI 56 as a template along with 
primers of SEQ ID Nos. 102 and 103 in the Sequence Listing. Consequently, DNA 
containing a gene encoding Transferrin-like protein (GenBank Accession Number AK005035; 
hereinafter abbreviated as TFL) derived from mouse was obtained. Double digestion of this 
DNA fragment was carried out with restriction enzymes Xhol and EcoRly and the product was 
subcloned into Xhol and Ecd91 sites of pBAD/HisA. As a result, a plasmid containing TFL 
gene imder control of the arabinose inducible promoter was created, and the resultant plasmid 
was named pHN280. 

All the proteins mentioned above that have signal peptide were subcloned without the 
DNA sequence encoding signal peptide. As for Prothrombin, the DNA sequence encoding 
"Prethrombin-2" immediately before Prothrombin becomes a matured Thrombin (Soejima et 
al., J.Biochem. 130 269-277 (2001)) was subcloned. 

The transformation of Rhodococcus erythropolis JCM3201 strain was carried out in the 
same manner eis in Example 12 using pHN171 (described in Exsimple 7), pHN205, pHN206, 
pHN208, pHN287, pHN289, pHN288, pHN270, pHN271, pHN272, pHN299, pHN284, 
pHN285, pHN286, pHN217, pHN298 and pHN291, respectively to express each protein at 
30**C and 4°C in the similar manner 2is in Example 13. 

All of these proteins had 6xHis tag linked to the end and were purified as in Example 
16- In addition to this, they were purified at this time also from the precipitation (described 
in Example 16) formed by centrifiigation at 20,000xg after cell destruction. The purification 
process from the precipitation was shown below specifically, and the procedure was conducted 
at room temperature. The precipitation was suspended into 1 ml DN-Buffer (50 mM 
Tris-HCl (pH 8.0), 8 M urea) and centrifuged at 20,000xg, and nickel-NTA Superflow which 
was beforehand equilibrated with DT-Buffer was added to 700 \i\ of the supernatant so that the 
bed volume might be 40 ^il. This mixture was subjected to rotation agitation for 1 hour to 
combine the nickel-NTA Superflow beads to the protein attached with 6xHis tag. After 
washing these beads 4 times by NT-Buffer, the protein was eluted from the protein attached 
with 6xHis tag by suspending these beads in 120 \A of DNE-Bufifer (50 mM Tris-HCl (pH 7.0), 
8 M urea, 400 mM imidazole) 3 times. 
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Proteins were expressed in E. coli using pBAD/His/lacZ (Invitrogen), pHN193, 
pHN195, pHN199, pHN276, pHN277, pHN281, pHN273, pHN275, pHN305, pHN212, 
pHN306, pHN308, pHN279, pHN278 and pHN280 following the manufacturer's instructions 
of the pBAD/His kit of Invitrogen. 

The purification process from the precipitation was shown below specifically. The 
cell bodies which expressed a protein were collected and suspended into 1 ml of NT-Buffer. 
The cells were destroyed by using an ultrasonic generator UD-20 (product of TOMY). The 
mixture was centrifiiged at 20,000xg, and nickel-NTA Superflow which was beforehand 
equilibrated with NT-Buffer was added to 900 |il of the supernatant so that the bed volume 
might be 40 This mixture was subjected to rotation agitation for 1 hour to combine the 
nickel-NTA Superflow beads to the protein attached with 6xHis tag. After washing these 
beads 4 times by NT-Buffer, the protein was eluted from the protein attached with 6xHis tag 
by suspending these beads in 120 i^l of NTE-Buffer 3 times. All of the above-mentioned 
procedure were conducted at 4®C. 

Purification was performed also from the precipitation formed by centrifugation at 
20,000xg after cell destruction, the procedure therefore was the same as that of the method 
mentioned above. 

10 ^1 of the above-mentioned samples was subjected to 12% SDS polyacrylamide 
electrophoresis following an ordinary method, and the density of the bands were measured and 
quantified by densitometer and the results are shown in Fig. 21. In this figure, the second 
column from the left shows the name of the protein expressed. The third column from the 
left shows to which of the N-terminal or C-terminal the 6xHis tag was attached. The fourth 
column from the left shows presumed molecular weight (kDa) of the full-length protein 
including signal sequence etc., and the number in parentheses shows presumed molecular 
weight of the actually expressed portion of the protein. The fifth and ninth columns from the 
left show the name of the plasmid used when protein was expressed. The sixth, eighth and 
tenth columns from the left show the mass of the recombinant protein obtained (imit: 
milligram) per liter and the results purified from the supematant fraction (Sup) centrifiiged at 
20,000xg and those purified from the precipitation fraction (Ppt) are separately shown. The 
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symbols + and - in the seventh and eleventh columns from the left show the proliferation rate 
when each transformant was applied on the agar culture medium containing an expression 
induction agent (1 |ag/ml of thiostrepton in the case of Rhodococcus erythropolis and 0.2% 
L-arabinose in the case of E. coli). The transformant proliferated most rapidly is indicated by 
"+-H-", and the transformant which did not proliferate at all is indicated by The host used, 
and the temperature at the time of expression induction are also shown in the topmost part. 
N.D. (Not Detected) shows that the result was below the detection limit. 

The list of each plasmid used in Examples is shown in Table 1 and list of the strains 
used in Examples is shown in Table 2. 
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Table 1 



Category 


Name of Plasmid 


Notes Source 


Comercially available cloning 
vector 


pBluescript II 
SK(+) 


Conventional vector for general cloning 


Stratagene 


pGEM oZ(f)+ 


Conventional vector for general cloning 


Prcmega 


Sources of expression vector 


PRE2895 


Source of RepA£6 

(Cryptic plasmid isolated from R. erythropolis 

JCMZoSo; 


This study 


ptf^136 


Backborne of the expression vector 


This study 


phl4143 


Source of Tnio 


This study 


pM^62 


Source of ALDHp-TipA (Inducer cassette) 


This study 


p(tl153 


Source of TipAp-PIP ORF-ALDHt (Expression cassette) 


This study 


pW169 


Source of Tuflp-Tet 


This study 


Control plasmid 


nHJ172 


Neither Expression cassette nor Indicer cassette 


This study 


nHJ173 


Expression cassette, but no Inducer cassette 


This study 


Expression plasmid 

in R. erythropolis 

(not containing target gene) 


fJ 1 1 }J iM 1 1 


TipAp. 6xHis at N-terminus. MCS typel 


This study 




TipAp. 6xHis at C-termmus, MCS typel 


This study 




TipAp, 6xHis at N-terminus. MCS type2 


This study 




TipAp. GxHis at C-terminus. MCS type2 


This study 




TipAH-GlOp, 6xHis at M-terminus, MCS typel 


This study 


nT i n-l PHI 


TipA-L610p. 6xHis at C-terminus. MCS typel 


This study 


nTirwl KU9 
Pl ipnJwlL 


TipA-LGIOp. 6xHis at N~terminus, MCS type2 


This study 


Pl IPT_14|Z 


TipA-LGlOp, 6xHis at 0-terminus, MCS type2 


This study 


pi ip-linl. 1 


TipAp, 6xHis at N-terminus. MCS type3 


This study 


nTirv-THQ 1 


TipAp. 6xHis at G-terminus. MCS type3 


This study 


nT i rv-l flHI 1 


TipA^.610p, 6xHis at N-terminus. MCS type3 


This study 


nT i rv-l flH9 1 
pi ip^vjrl^. 1 


TipA-LGlOp, 6xHis at 0-terminus. MCS type3 


This study 


Expression plasmid 
in fi erythropolis 
(containing target gene) 


rMMC\ 
IJriii /u 


Target=/'/P in pTip-CHI 


This study 


rM171 


^&rgtt=^PIP in pTip-LCHI 


This study 


|jni 1 / u 


Jarget=AtP/P in pTip-CHI 


This study 




Target=AtP/P m pTip-LCHl 


This study 




Target=fl^ in pTipHHI 


This study 




Target=fl9^ m pTipHJUl 


This study 


|JI II^O^i 


Target=fifr in pTip-NH2 


This study 




Target=fif7^ in pTip-tWC 


This study 




Target=&»/ In pTip-LNHI 


This study 


priMilUQ 


Target=MW/ in pTipHJtll 


This study 


rJ-IJOnQ 


larget=CytoGhrcme b5/ in pTip-LNHI 


This study 




Target=Z£/i3 in pTip-Utfl 


This study 


pni^ov 


Target=rrBnsferr/n in pTip4Jiil 


This study 


|Xli^O o 


Target=4Dcia5' in pTip-IJH2 


This study 




^axgiSit-Cathepsin D in pTipH_CH1 


This study 


pi / 1 


1 ax^jsst—rrotnronvin i n p 1 1 p-Lull 


This study 


pHC72 


larget-KallikreinS in pTip-LCHI 


This study 


pm299 


Target=ZiZ»lb5e in pTip-UHI 


This study 


pm284 


Target=ZL4^ in pTip-LMC 


This study 


pW285 


Target=^M^/ in pTip-LMC 


This study 


PHN286 


Target=A7W in pTip-iNH2 


This study 


pm217 


Target=£tor alpha in pTip-UHl 


This study 


pm298 


^arget-Glucda'nase in pTipHJtll 


This study 


P»I291 


Target=ia?Z4 in pTip-LCH2 


This study 
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Category 


Nane of Plasmid 


Notes 


Source 


Expression plasmid in Ecoli 
(not containing target gene) 


pBAD/HisA 


BAD promoter. 6xHis at W-terminus. Xpress Epitope, MCS 


Invitrogen 


pBAD-Llnker 


BAD pranoter, for library construction 


This study 


Expression plasmid In Ecoli 
(containing target gene) 


p»i193 


Targe!t=&»/ in pBAD/HisA 


This study 


pfM195 


Target=/l««W^ in pBAD/HisA 


This study 


p»4199 


'\wg<iit^Cytodirane bSI in pBAD/HisA 


This study 


pm276 


Target=Z£/i!? in pBAD/HisA 


This study 


pm277 


^arget^Trmsferrin in pBAD/HlsA 


This study 


PH\I281 


Target=4oaa5 in pBAD/HisA 


This study 


pm273 


Jzre.eit=Caif]eosin D in oBAD/HisA 


Th i e e-f-i jHv 
1 1 1 1 o o uju jr 




1 argeL— Ao/ / IfKiGIlK} \ \\ PDnLI/rl 1 SA 


1 Im I A #«^l MMfc J 

in IS SLUoy 


pm305 


Target=/^/ in pBAD/«isA 


This study 




^^x&\^Bax alpha in pBAD/HisA 


This study 




^9x&i^Gluookina^ in pBAD/HisA 


This study 


P»fi308 


Target=A?Z4 in pBAD/HisA 


This study 


pm279 


Target=/'^ in pBAD/HisA 


This study 


pm278 


^zr^^^^Peroxi redox in4 in pBAD/HlsA 


This study 


pm280 


Target=77Zin pBAD/HisA 


This study 


pBAD/His/lacZ 


Target=/a5C!Z' 


Invitrogen 
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All the publications, patent, and patent application cited in this specification are entirely 
incorporated as references. 

Industrial Applicability 

As shown in Examples 13 to 18 and Example 20, expression and production of a 
protein encoded by a foreign gene are enabled under low temperature conditions of 4^*0 by 
using the expression vector of the present invention. 

Free words in the Sequence Listing 
Sequences 1 to 105: primers 
Sequences 106 to 1 13: vectors 
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CLAIMS 

1. An expression vector which can inducibly express a foreign gene with an inducing 
substance in a host cell and can be expressed at a temperature equal to or below the suitable 
growth temperature range of a host other than said host. 

2. An expression vector which can inducibly express a foreign gene with an inducing 
substance in a host cell and can be expressed at a temperature of 15°C or lower. 

3. The expression vector according to claim 1 or 2 which can be expressed at 4°C. 

4. The expression vector according to any one of claims 1 to 3 wherein the host cell is a 
bacterium of genus Rhodococcus. 

5. The expression vector according to claim 4 wherein the bacterium of genus 
Rhodococcus is selected from the group consisting of R.erythropolis, R,fascians and R,opacus. 

6. The expression vector according to any one of claims 1 to 5 wherein the inducing 
substance is thiostrepton. 

7. The expression vector according to any one of claims 1 to 6 wherein the foreign gene 
encodes a protein which inhibits proliferation of the host cell imder medium to high 
temperature conditions exceeding 1 5^C. 

8. The expression vector according to any one of claims 1 to 7 wherein the vector 
comprises a promoter sequence whose expression can be regulated by an inducing substance 
and a multicloning site to which a foreign gene can be introduced. 

9. A transformant comprising an expression vector according to any one of claims 1 to 8. 

10. A method for producing protein using an expression vector according to any one of 
claims 1 to 8. 

1 1 . An inducible expression vector which can inducibly express a gene encoding a protein 
which inhibits proliferation of the host cell when it is expressed at a temperature within the 
range of suitable growth temperature of the host cell in another host cell having a suitable 
growth temperature range lower than the suitable growth temperature of said host cell using an 
inducing substance. 

12. The expression vector according to claim 1 1 which can be expressed at 4**C. 
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13. The expression vector according to claim 1 1 or 12 wherein the host cell is a bacterium 
of genus Rhodococcus. 

14. The expression vector according to claim 13 wherein the bacterium of genus 
Rhodococcus is selected from the group consisting of R.erythropolis, RJascians and Ropacus. 

15. The expression vector according to any one of claims 11 to 14 wherein the inducing 
substance is thiostrepton. 

16. The expression vector according to any one of claims 11 to 15 wherein the vector 
comprises a promoter sequence whose expression can be regulated by an inducing substance 
and a multicloning site to which a foreign gene can be introduced. 

17. A transformant comprising an expression vector according to any one of claims 11 to 
16. 

18. A method for producing a protein using an expression vector according to any one of 
claims 11 to 16. 

19. An expression vector which can inducibly express a foreign gene using an inducing 
substance in a bacterium of genus Rhodococcus. 

20. The expression vector according to claim 19 wherein the bacterium of genus 
Rhodococcus is selected from the group consisting of R.erythropolis, RJascians and R,opacus, 

21. The expression vector according to claim 19 or 20 wherein the inducing substance is 
thiostrepton. 

22. An expression vector according to any one of claims 19 to 21 wherein the vector 
comprises an expression cassette comprising a TipA gene promoter sequence, the first 
multicloning site to which a foreign gene can be introduced and a transcription termination 
sequence, an inducer cassette comprising the second promoter sequence and TipA gene, a 
DNA region essential to autonomous replication of a plasmid for a bacterium of genus 
Rhodococcus and a thiostrepton resistance gene. 

23. A transformant of a bacterium of genus Rhodococcus comprising an expression vector 
according to any one of claims 19 to 22. 

24. A method for producing a protein using an expression vector according to any one of 
claims 19 to 22. 
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25. An inducible expression vector for a bacterium of genus Rhodococcus which can 
inducibly express a gene encoding a protein whose expression product inhibits proliferation of 
the host cell under medium to high temperature conditions exceeding 15^C in a bacterium of 
genus Rhodococcus which can proliferate under low-temperature conditions. 

26. An inducible expression vector for a bacterium of genus Rhodococcus whose 
expression can be induced in a bacterium of genus Rhodococcus in which a foreign gene can 
be multiplied under low-temperature conditions, wherein the vector comprises an expression 
cassette comprising a TipA gene promoter sequence, the first multicloning site to which a 
foreign gene can be introduced and a transcription termination sequence, an inducer cassette 
comprising the second promoter sequence and TipA gene, a DNA region essential to 
autonomous replication of a plasmid for a bacterium of genus Rhodococcus and a thiostrepton 
resistance gene. 

27. The inducible expression vector for a bacterium of genus Rhodococcus according to 
claim 26 wherein the vector further comprises a DNA region essential to autonomous 
replication of a plasmid for E. coli and can be reproduced in E, coli. 

28. The inducible expression vector for a bacterium of genus Rhodococcus according to 
claim 26 or 27 wherein the TipA gene promoter is TipA-LGlO promoter. 

29. The inducible expression vector for a bacterium of genus Rhodococcus according to 
any one of claims 26 to 28 wherein the vector is selected from the group consisting of 
pTip-NHl having a nucleotide sequence represented by SEQ ID No. 106, pTip-NH2 having a 
nucleotide sequence represented by SEQ ID No. 107, pTip-CHl having a nucleotide sequence 
represented by SEQ ID No. 108, pTip-CH2 having a nucleotide sequence represented by SEQ 
ID No. 109, pTip-LNHl having a nucleotide sequence represented by SEQ ID No. 110, 
pTip-LNH2 having a nucleotide sequence represented by SEQ ID No. Ill, pTip-LCHl having 
a nucleotide sequence represented by SEQ ID No. 112, pTip-LCH2 having a nucleotide 
sequence represented by SEQ ID No. 113, pTip-CHl.l, pTip-CH2.1, pTip-LCHl.l and 
pTip-LCH2.1. 
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30. The inducible expression vector for a bacterium of genus Rhodococcus according to 
any one of claims 25 to 29 wherein the bacterium of genus Rhodococcus is selected from the 
group consisting of R.erythropolis, R.fascians £ind R.opacus, 

31. A transformant of a bacterium of genus Rhodococcus comprising an inducible 
expression vector for a bacterium of genus Rhodococcus according to any one of cleiims 25 to 
30. 

32. A process for producing a protein whose expression product inhibits proliferation of the 
host cell under medixim to high temperature conditions exceeding 15°C wherein the process 
comprises introducing an inducible expression vector for a bacterium of genus Rhodococcus 
according to any one of claims 25 to 30 comprising as a foreign gene a gene encoding a 
protein difiScult to be expressed under medixrai to high temperature conditions exceeding 15®C 
into a bacterium of genus Rhodococcus which can proliferate at low temperature; and 
culturing the above-mentioned bacteria into which an inducible expression vector for a 
bacterium of genus Rhodococcus is introduced using a culture medium comprising 
thiostrepton under low-temperature. 

33. The process for producing a protein at low temperature according to claim 32 wherein 
the protein difficult to be expressed xmder medium to high temperature conditions exceeding 
15°C is a protein which inhibits proliferation of the host cell under medium to high 
temperature conditions exceeding 15°C. 

34. The process for producing a protein at low temperature according to claim 32 wherein 
the protein difficult to be expressed under medium to high temperature conditions exceeding 
15''C is a protein which forms an inactive inclusion body in E. coli when it is made expressed 
xmder medium to high temperature conditions exceeding 15®C. 

35. A process for producing a protein derived from psychrotrophic bacteria, heterothermic 
animals or plants living under a low-temperature environment wherein the process comprises 
introducing an inducible expression vector for a bacterium of genus Rhodococcus according to 
any one of claims 25 to 30 comprising a gene encoding a protein derived from psychrotrophic 
bacteria, heterothermic animals or plants living under a low-temperature environment into a 
bacterium of genus Rhodococcus which can proliferate at low temperature; and culturing the 
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above-mentioned bacteria into which an inducible expression vector for a bacterium of genus 
Rhodococcus is introduced using a culture medium comprising thiostrepton under 
low-temperature. 

36. A process for screening a protein difficult to be expressed under medium to high 
temperature conditions exceeding 15®C wherein the process comprises introducing an 
inducible expression vector for a bacterium of genus Rhodococcus according to any one of 
claims 25 to 30 comprising a foreign gene into a bacterium of genus Rhodococcus which can 
proliferate at low temperature; culturing the above-mentioned bacteria into which an inducible 
expression vector for a bacterium of genus Rhodococcus is introduced using a culture medium 
comprising thiostrepton under medium to high temperature conditions exceeding IS^C and 
under low-temperature condition; and selecting a gene which can be expressed only under 
low-temperature condition equal to or below 15®C. 

37. The process for screening a protein difficult to be expressed under medium to high 
temperature conditions exceeding IS'^C according to claim 36 wherein the protein difficult to 
be expressed imder medium to high temperature conditions exceeding 15®C is a protein which 
inhibits proliferation of the host cell under medium to high temperature conditions exceeding 
15°C. 

38. A process for screening a protein difficult to be expressed under medium to high 
temperature conditions exceeding 15°C wherein the process comprises selecting a gene which 
does not express or inhibits proliferation of E. coli when it is introduced and made expressed 
under medium to high temperature conditions exceeding 15®C; and then introducing an 
inducible expression vector for a bacterium of genus Rhodococcus according to any one of 
claims 25 to 30 comprising said gene as a foreign gene into a bacterium of genus Rhodococcus 
which can proliferate at low temperature; and selecting a gene which can be expressed when 
cultured under low-temperature condition using a culture medium comprising thiostrepton. 

39. The process for screening a protein according to claim 38 wherein the protein difficult 
to be expressed under medium to high temperatvire conditions exceeding 15°C is a protein 
which inhibits proliferation of E, coli at 30*^0 or higher. 



67 



40. The process for screening a protein according to claim 38 wherein the protein difficult 
to be expressed under medium to high temperature conditions exceeding 15°C is a protein 
which forms an inclusion body when it is made expressed under medium to high temperature 
conditions exceeding 15°C. 

41. The process for screening a protein according to claim 38 wherein the protein difficult 
to be expressed under medium to high temperature conditions exceeding 15°C is a protein 
which inhibits proliferation of the host cell imder medium to high temperature conditions 
exceeding 15°C 

42. A protein difficult to be expressed under medium to high temperature conditions 
exceeding 15°C obtained by a screening process according to any one of claims 36 to 41. 
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ABSTRACT 



An inducible expression vector which can express a protein at low temperature and a 
method for expressing a protein at low temperature using this vector are provided. The 
inducible expression vector can express a gene encoding a protein whose expression product 
inhibits proliferation of the host cell under medium to high temperature conditions exceeding 
about 1 5°C in a host cell which can proliferate under low temperature conditions. 
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<110> National Institute of Advanced Industrial Science and Technology 

<120> A novel expression vector suitable for the expression at 1 
temperature 

<130> 331H01018 

<140> 
<141> 

<160> 113 

<170> Patentin Ver. 2. 1 

<210> 1 
<211> 25 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: primer sHNl 
<400> 1 

cagagctcgt caggtggcac ttttc 25 

<210> 2 
<211> 30 

1/81 



<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: primer sHN2 

<400> 2 

gttgtacaac tagtcgtgcc agctgcatta 

<210> 3 
<211> 26 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence : primer sHN120 
<400> 3 

gctgtacacc cgagaagctc ccagcg 
<210> 4 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer sHN121 
<400> 4 

cggagctctt gaacgagagt tggccgttg 

2/81 



<210> 5 
<211> 39 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence : primer sHN122 
<400> 5 

tcagatctat cgtcatcgac tgcgatcacg ttgacgccg 

<210> 6 
<211> 31 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer sHN123 
<400> 6 

acggatcctc cgctgaaatc tcgccgtgcc t 

<210> 7 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer sHNlSO 

3/81 



<400> 7 

cttcatatgc ggagctcgac cgcgcggg 

<210> 8 
<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer sHNlSl 
<400> 8 

atcgagtcgt tcaagggcgt cggc 
<210> 9 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer NE61233 
<400> 9 

agcggataac aatttcacac agg 

<210> 10 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial 
<400> 10 

caccaggatg atccccgac 

<210> 11 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
<400> 11 

gacagtgaca tcaccagc 

<210> 12 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
<400> 12 

cgccagggtt ttcccagtca cgac 



Sequence: primer sHNlO 



19 



Sequence: primer sHNll 



18 



Sequence: primer NEB1224 



24 
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<210> 13 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer sHN40 
<400> 13 

atgagctact ccgtgggaca ggtg 

<210> 14 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer sHN41 
<400> 14 

tgcagatctt ccgtttcgac gtgacggag 

<210> 15 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer sHN42 
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<400> 15 

cagtctagaa ttgatctcct cgaccg 



26 



<210> 16 
<211> 20 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: primer sHN43 
<400> 16 

tgcaagctcc tatgtaaacg 20 



<210> 17 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer sHN55 
<400> 17 

cgcctgctcc acggccgcc 19 

<210> 18 
<211> 18 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer sHN56 
<400> 18 

atggaggcac gcagcatg 

<210> 19 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer sHN57 
<400> 19 

cgccccctcg gagtcggcg 

<210> 20 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer sHN58 
<400> 20 

atggacgccg ccgaggac 
<210> 21 
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<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer sHN147 
<400> 21 

cgtgtacata tcgaggcggg ctccca 

<210> 22 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer sHN39 

<400> 22 

atccatggcc gctcccttct ctgacgccgt c 

<210> 23 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer sHN36 
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<400> 23 

accatggatc aggaatgcat ag 



22 



<210> 24 
<211> 59 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: primer sHN37 



<400> 24 

ttactagttt attaatgatg atgatgatga tgcaggtgtt tcaggatgaa atccgaaag 59 



<210> 25 
<211> 29 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: primer sHN6 
<400> 25 

cgtctagagt cccgctgagg cggcgtagc 29 

<210> 26 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: primer sHN9 
<400> 26 

ctactagtcg acccaccggc acccgtgag 

<210> 27 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer sHN141 

<400> 27 

aatctagagt aacgggctac tccgtttaac 

<210> 28 
<211> 30 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence : primer sHN142 
<400> 28 

gggtcgacgg tcctcctgtg gagtggttct 
<210> 29 
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<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer sHN145 
<400> 29 

gcactcgaga tgaaatctaa caatgcgctc ate 

<210> 30 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer sHN152 

<400> 30 

agactagtcc tcaacgacag gagcacgatc 

<210> 31 

<211> 22 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: primer T7 



12/81 



<400> 31 

gtaatacgac tcactatagg gc 



<210> 32 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer sHN153 
<400> 32 

aatccacagg acgggtgtgg 
<210> 33 
<211> 19 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: primer sHN154 
<400> 33 

ctctacgccg gacgcatcg 

<210> 34 

<211> 22 

<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: primer T3 
<400> 34 

gcaattaacc ctcactaaag gg 

<210> 35 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: primer sHN155 
<400> 35 

acgacgctct cccttatgcg 

<210> 36 
<211> 19 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer sHN156 
<400> 36 

ccgatgccct tgagagcct 



<210> 37 
<211> 67 
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<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: primer sHNllO 

<400> 37 

aaccatggta tatctccttc ttaaagttaa acaaaattat ttctagacgc cgtccacgct 60 
gcctcct 67 

<210> 38 
<211> 77 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer NNcol 
<400> 38 

catgggccac catcaccatc accatatggg aattctacgt agcggccgcg gatccaagct 60 
tagatctcga ggatgaa 77 

<210> 39 
<211> 77 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer NNco2 
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<400> 39 

ctagttcatc ctcgagatct aagcttggat ccgcggccgc tacgtagaat tcccatatgg 60 
tgatggtgat ggtggcc 77 
<210> 40 
<211> 71 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer CNcol 
<400> 40 

catgggaatt ctacgtagcg gccgcggatc caagcttaga tctcgaggac atcaccatca 60 
ccatcactga a 71 

<210> 41 
<211> 71 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer CNco2 
<400> 41 

ctagttcagt gatggtgatg gtgatgtcct cgagatctaa gcttggatcc gcggccgcta 60 
cgtagaattc c 71 



<210> 42 
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<211> 29 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: primer sHN159 
<400> 42 

tccatatgcg ctcccttctc tgacgccgt 29 

<210> 43 
<211> 80 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer NNdel 
<400> 43 

tatgggccat caccatcacc atcacgccat gggaattcta cgtagcggcc gcggatccaa 60 
gcttagatct cgaggatgaa 80 

<210> 44 

<211> 82 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer NNde2 
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<400> 44 

ctagttcatc ctcgagatct aagcttggat ccgcggccgc tacgtagaat tcccatggcg 60 
tgatggtgat ggtgatggcc ca 82 



<210> 45 
<211> 71 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: primer CNdel 
<400> 45 

tatgggaatt ctacgtagcg gccgcggatc caagcttaga tctcgaggac atcaccatca 60 
ccatcactga a 71 

<210> 46 
<211> 73 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: primer CNde2 
<400> 46 

ctagttcagt gatggtgatg gtgatgtcct cgagatctaa gcttggatcc gcggccgcta 60 
cgtagaattc cca 73 
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<210> 47 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer sHN160 
<400> 47 

aacatatgta tatctccttc ttaaagttaa ac 

<210> 48 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer sHN97 
<400> 48 

ataccatgga acctcatgaa gc 

<210> 49 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer sHN98 
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<400> 49 

aactcgagat cccataagtg ctttcatctt 

<210> 50 
<211> 27 

<212> DNA 

<213> Artificial Sequence 



30 



<220> 

<223> Description of Artificial Sequence: primer sHN82 
<400> 50 

tactcatgat gcatcaccat caccatc 27 

<210> 51 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
pTrc99A-Cseq 

<400> 51 

cagaccgctt ctgcgttctg 20 



<210> 52 
<211> 27 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer sHN272 
<400> 52 

atccatggcc cctatactag gttattg 

<210> 53 

<211> 33 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: primer sHN271 
<400> 53 

aactcgagtc aatccgattt tggaggatgg teg 
<210> 54 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer sHN150 



<400> 54 

catgggaatt cagatctctc gaga 
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<210> 55 
<211> 24 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: primer sHN212 
<400> 55 

agcttctcga gagatctgaa ttcc 24 

<210> 56 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
pBAD (Forward) 

<400> 56 

ctatgccata gcatttttat cc 22 

<210> 57 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: primer sHN166 
<400> 57 

gccatatggg gtttttttca tttgttcacg 

<210> 58 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer sHN167 
<400> 58 

aactcgagtc agtatttgtc aggcagtcc 

<210> 59 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer sHN168 
<400> 59 

gcctcgaggg gtttttttca tttgttcacg 



<210> 60 
<211> 29 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer sHN169 

<400> 60 

gaggtacctc agtatttgtc aggcagtcc 
<210> 61 
<211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer sHN170 
<400> 61 

cacatatgct ccgccagatc ctcgg 

<210> 62 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer sHN171 
<400> 62 

ttgaattctt agaagtctgg gccttctttc 
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<210> 63 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer sHN172 
<400> 63 

ccctcgagat gctccgccag atcctcgg 

<210> 64 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer sHN177 
<400> 64 

cccatatggc cgggcagtca gacaag 

<210> 65 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer sHN178 
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<400> 65 

gagaattctc aatcttctgc catgtagagg 
<210> 66 
<211> 29 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: primer sHN179 
<400> 66 

aactcgagat ggccgggcag tcagacaag 

<210> 67 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer sHN290 
<400> 67 

aacatatgaa caagagctct gaagatatcc 

<210>'68 
<211> 26 
<212> DNA 

<213> Artificial Sequence 

26/81 



<220> 

<223> Description of Artificial Sequence: primer sHN262 
<400> 68 

atgaattcat ggcaaccatc taactg 

<210> 69 
<211> 31 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: primer sHN261 
<400> 69 

ttctcgagaa caagagctct gaagatatcc g 

<210> 70 
<211> 29 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer sHN295 
<400> 70 

aacatatggc tgtccctgac aaaacggtc 



<210> 71 
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<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer sHN266 
<400> 71 

ctaagctttt aatgtttgtg gaaagtgc 

<210> 72 
<211> 31 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer sHN265 
<400> 72 

gtctcgaggt ccctgacaaa acggtcaaat g 

<210> 73 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence .'primer sHN288 
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<400> 73 

ttccatggca cggaagagcc tctggg 

<210> 74 
<211> 27 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence :primersHN268 
<400> 74 

ttgaattcca gacaatgagc tggaggg 

<210> 75 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer sHN267 
<400> 75 

aactcgagcg gaagagcctc tgggactac 

<210> 76 
<211> 31 

<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: primer sHN243 

<400> 76 

ctccatgggg attatcagaa tccctctgcg c 

<210> 77 

<211> 33 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: primer sHN244 
<400> 77 

agctcgagag agtacgacag cattggcaaa gcc 
<210> 78 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer sHN245 
<400> 78 

aaccatgggc accaccgatg cggagttcca cacc 



<210> 79 
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<211> 40 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: primer sHN246 
<400> 79 

aactcgagat ccaaattgat caatgacttt ctgtatccac 

<210> 80 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer sHN247 
<400> 80 

aaccatggga attgttggag gatttaactg tgag 

<210> 81 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer sHN248 



<400> 81 
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aactcgagag tcattttcag ccatagtttc tcttatcc 



38 



<210> 82 
<211> 35 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence : primer sHN275 
<400> 82 

aacatatgct gaggctctgc tccttcaatg tgagg 35 



<210> 83 
<211> 25 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: primer sHN307 
<400> 83 

ttctcgagcg tgatacctag gagcg 25 



<210> 84 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: primer sHN277 
<400> 84 

ggccatgggg acaccagaaa tctcatgc 

<210> 85 
<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer sHN278 
<4(?0> 85 

aagaattcac cgagtttact tacagaaccc 

<210> 86 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer sHN279R 
<400> 86 

aaccatgggc aaaggagatc ctaagaag 



<210> 87 
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<211> 33 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: primer sHN280 

<400> 87 

ttgaattcct gcgctagaac caacttattc ate 

<210> 88 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer sHN314 
<400> 88 

aactcgaggg caaaggagat cctaagaag 

<210> 89 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer sHN283 



<400> 89 
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gaccatggct cctgagcaat gggaag 
<210> 90 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer sHN284 
<400> 90 

ataagctttt aagggtcctc atccacgtga a 

<210> 91 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer sHN164 

<400> 91 

aacatatgga cgggtccggg gagcag 

<210> 92 
<211> 30 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: primer sHN194 
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<400> 92 

aagaattctc agcccatctt cttccagatg 

<210> 93 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer sHN193 
<400> 93 

aactcgagat ggacgggtcc ggggagca 

<210> 94 
<211> 26 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: primer sHN305 
<400> 94 

ctcatatggc tgtggatact acaagg 
<210> 95 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: primer sHN306 

<400> 95 

atctcgagga tttcactggc ccagcatgc 

<210> 96 
<211> 30 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: primer sHN296 
<400> 96 

aactcgagcg tcggtatcct ttttgcgctg 

<210> 97 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer sHN330 
<400> 97 

acccatgggc gacggtgctg gaaattg 
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<210> 98 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer sHN259 
<400> 98 

aactcgagat gaagcttgta aatggcagaa ag 

<210> 99 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer sHN260 
<400> 99 

aagaattcct ctactgtgta tcggtcat 

<210> 100 
<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer sHN269 
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<400> 100 

aactcgagct gcaaggcttg gagagtgatg 

<210> 101 
<211> 32 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: primer sHN270 
<400> 101 

gaggtacctt tcagtttagc ttgtcgaaat ac 

<210> 102 
<211> 31 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence .'primer sHN263 
<400> 102 

gcctcgagct tcctgagaag accatacgat g 

<210> 103 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: primer sHN264 
<400> 103 

ctgaattctg tttaatattt atgaaatgtg 

<210> 104 
<211> 30 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: primer sHN343 

0 

<400> 104 

aaactagttc agtgatggtg atggtgatgc tcgagagatc t 

<210> 105 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer T7 Universal 
<400> 105 

taatacgact cactataggg 



40/81 



<210> 106 
<211> 8166 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: vector pTip-NHl 
<400> 106 

gagctcgacc gcgcgggtcc cggacgggga agagcgggga gctttgccag agagcgacga 60 

cttccccttg cgttggtgat tgccggtcag ggcagccatc cgccatcgtc gcgtagggtg 120 

tcacacccca ggaatcgcgt cactgaacac agcagccggt aggacgacca tgactgagtt 180 

ggacaccatc gcaaatccgt ccgatcccgc ggtgcagcgg atcatcgatg tcaccaagcc 240 

gtcacgatcc aacataaaga caacgttgat cgaggacgtc gagcccctca tgcacagcat 300 

cgcggccggg gtggagttca tcgaggtcta cggcagcgac agcagtcctt ttccatctga 360 

gttgctggat ctgtgcgggc ggcagaacat accggtccgc ctcatcgact cctcgatcgt 420 

caaccagttg ttcaaggggg agcggaaggc caagacattc ggcatcgccc gcgtccctcg 480 

cccggccagg ttcggcgata tcgcgagccg gcgtggggac gtcgtcgttc tcgacggggt 540 

gaagatcgtc gggaacatcg gcgcgatagt acgcacgtcg ctcgcgctcg gagcgtcggg 600 

gatcatcctg gtggacagtg acatcaccag catcgcggac cggcgtctcc aaagggccag 660 

ccgaggttac gtcttctccc ttcccgtcgt tctctccggt cgcgaggagg ccatcgcctt 720 

cattcgggac agcggtatgc agctgatgac gctcaaggcg gatggcgaca tttccgtgaa 780 

ggaactcggg gacaatccgg atcggctggc cttgctgttc ggcagcgaaa agggtgggcc 840 

ttccgacctg ttcgaggagg cgtcttccgc ctcggtttcc atccccatga tgagccagac 900 

cgagtctctc aacgtttccg tttccctcgg aatcgcgctg cacgagagga tcgacaggaa 960 

tctcgcggcc aaccgataag cgcctctgtt cctcggacgc tcggttcctc gacctcgatt 1020 

cgtcagtgat gatcacctca cacggcagcg atcaccactg acatatcgag gtcaacggtc 1080 

gtggtccggg cgggcactcc tcgaaggcgc ggccgacgcc cttgaacgac tcgatgactc 1140 
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tagagtaacg ggctactccg tttaacggac cccgttctca cgctttaggc ttgaccccgg 1200 

agcctgcatg gggcattccg ccgtgaaccc ggtggaatgc ccccggcacc cgggctttcc 1260 

agcaaagatc acctggcgcc gatgagtaag gcgtacagaa ccactccaca ggaggaccgt 1320 

cgagatgaaa tctaacaatg cgctcatcgt catcctcggc accgtcaccc tggatgctgt 1380 

aggcataggc ttggttatgc cggtactgcc gggcctcttg cgggatatcg tccattccga 1440 

cagcatcgcc agtcactatg gcgtgctgct agcgctatat gcgttgatgc aatttctatg 1500 

cgcacccgtt ctcggagcac tgtccgaccg ctttggccgc cgcccagtcc tgctcgcttc 1560 

gctacttgga gccactatcg actacgcgat catggcgacc acacccgtcc tgtggattct 1620 

ctacgccgga cgcatcgtgg ccggcatcac cggcgccaca ggtgcggttg ctggcgccta 1680 

tatcgccgac atcaccgatg gggaagatcg ggctcgccac ttcgggctca tgagcgcttg 1740 

tttcggcgtg ggtatggtgg caggccccgt ggccggggga ctgttgggcg ccatctcctt 1800 

gcatgcacca ttccttgcgg cggcggtgct caacggcctc aacctactac tgggctgctt 1860 

cctaatgcag gagtcgcata agggagagcg tcgtccgatg cccttgagag ccttcaaccc 1920 

agtcagctcc ttccggtggg cgcggggcat gactatcgtc gccgcactta tgactgtctt 1980 

ctttatcatg caactcgtag gacaggtgcc ggcagcgctc tgggtcattt tcggcgagga 2040 

ccgctttcgc tggagcgcga cgatgatcgg cctgtcgctt gcggtattcg gaatcttgca 2100 

cgccctcgct caagccttcg tcactggtcc cgccaccaaa cgtttcggcg agaagcaggc 2160 

cattatcgcc ggcatggcgg ccgacgcgct gggctacgtc ttgctggcgt tcgcgacgcg 2220 

aggctggatg gccttcccca ttatgattct tctcgcttcc ggcggcatcg ggatgcccgc 2280 

gttgcaggcc atgctgtcca ggcaggtaga tgacgaccat cagggacagc ttcaaggatc 2340 

gctcgcggct cttaccagcc taacttcgat cattggaccg ctgatcgtca cggcgattta 2400 

tgccgcctcg gcgagcacat ggaacgggtt ggcatggatt gtaggcgccg ccctatacct 2460 

tgtctgcctc cccgcgttgc gtcgcggtgc atggagccgg gccacctcga cctgaatgga 2520 

agccggcggc acctcgctaa cggattcacc actccaagaa ttggagccaa tcaattcttg 2580 

cggagaactg tgaatgcgca aaccaaccct tggcagaaca tatccatcgc gtccgccatc 2640 

tccagcagcc gcacgcggcg catctcgggc agcgttgggt cctggccacg ggtgcgcatg 2700 

atcgtgctcc tgtcgttgag gactagaatt gatctcctcg accgccaatt gggcatctga 2760 

gaatcatctg cgtttctcgc acgcaacgta cttgcaacgt tgcaactcct agtgttgtga 2820 

atcacacccc accggggggt gggattgcag tcaccgattt ggtgggtgcg cccaggaaga 2880 
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tcacgtttac ataggagctt gcaatgagct actccgtggg acaggtggcc ggcttcgccg 2940 

gagtgacggt gcgcacgctg caccactacg acgacatcgg cctgctcgta ccgagcgagc 3000 

gcagccacgc gggccaccgg cgctacagcg acgccgacct cgaccggctg cagcagatcc 3060 

tgttctaccg ggagctgggc ttcccgctcg acgaggtcgc cgccctgctc gacgacccgg 3120 

ccgcggaccc gcgcgcgcac ctgcgccgcc agcacgagct gctgtccgcc cggatcggga 3180 

aactgcagaa gatggcggcg gccgtggagc aggcgatgga ggcacgcagc atgggaatca 3240 

acctcacccc ggaggagaag ttcgaggtct tcggcgactt cgaccccgac cagtacgagg 3300 

aggaggtccg ggaacgctgg gggaacaccg acgcctaccg ccagtccaag gagaagaccg 3360 

cctcgtacac caaggaggac tggcagcgca tccaggacga ggccgacgag ctcacccggc 3420 

gcttcgtcgc cctgatggac gcgggtgagc ccgccgactc cgagggggcg atggacgccg 3480 

ccgaggacca ccggcagggc atcgcccgca accactacga ctgcgggtac gagatgcaca 3540 

cctgcctggg cgagatgtac gtgtccgacg aacgtttcac gcgaaacatc gacgccgcca 3600 

agccgggcct cgccgcctac atgcgcgacg cgatcctcgc caacgccgtc cggcacaccc 3660 

cctgagcggt ggtcgtggcc cgggtctccc gcccggtctc accccacggc tcactcccgg 3720 

gccacgacca ccgccgtccc gtacgcgcac acctcggtgc ccacgtccgc cgcctccgtc 3780 

acgtcgaaac ggaagatccc cgggtaccga gctcgtcagg tggcactttt cggggaaatg 3840 

tgcgcggaac ccctatttgt ttatttttct aaatacattc aaatatgtat ccgctcatga 3900 

gacaataacc ctgataaatg cttcaataat attgaaaaag gaagagtatg agtattcaac 3960 

atttccgtgt cgcccttatt cccttttttg cggcattttg ccttcctgtt tttgctcacc 4020 

cagaaacgct ggtgaaagta aaagatgctg aagatcagtt gggtgcacga gtgggttaca 4080 

tcgaactgga tctcaacagc ggtaagatcc ttgagagttt tcgccccgaa gaacgttttc 4140 

caatgatgag cacttttaaa gttctgctat gtggcgcggt attatcccgt attgacgccg 4200 

ggcaagagca actcggtcgc cgcatacact attctcagaa tgacttggtt gagtactcac 4260 

cagtcacaga aaagcatctt acggatggca tgacagtaag agaattatgc agtgctgcca 4320 

taaccatgag tgataacact gcggccaact tacttctgac aacgatcgga ggaccgaagg 4380 

agctaaccgc ttttttgcac aacatggggg atcatgtaac tcgccttgat cgttgggaac 4440 

cggagctgaa tgaagccata ccaaacgacg agcgtgacac cacgatgcct gtagcaatgg 4500 

caacaacgtt gcgcaaacta ttaactggcg aactacttac tctagcttcc cggcaacaat 4560 

taatagactg gatggaggcg gataaagttg caggaccact tctgcgctcg gcccttccgg 4620 
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ctggctggtt tattgctgat aaatctggag ccggtgagcg tgggtctcgc ggtatcattg 4680 

cagcactggg gccagatggt aagccctccc gtatcgtagt tatctacacg acggggagtc 4740 

aggcaactat ggatgaacga aatagacaga tcgctgagat aggtgcctca ctgattaagc 4800 

attggtaact gtcagaccaa gtttactcat atatacttta gattgattta aaacttcatt 4860 

tttaatttaa aaggatctag gtgaagatcc tttttgataa tctcatgacc aaaatccctt 4920 

aacgtgagtt ttcgttccac tgagcgtcag accccgtaga aaagatcaaa ggatcttctt 4980 

gagatccttt ttttctgcgc gtaatctgct gcttgcaaac aaaaaaacca ccgctaccag 5040 

cggtggtttg tttgccggat caagagctac caactctttt tccgaaggta actggcttca 5100 

gcagagcgca gataccaaat actgttcttc tagtgtagcc gtagttaggc caccacttca 5160 

agaactctgt agcaccgcct acatacctcg ctctgctaat cctgttacca gtggctgctg 5220 

ccagtggcga taagtcgtgt cttaccgggt tggactcaag acgatagtta ccggataagg 5280 

cgcagcggtc gggctgaacg gggggttcgt gcacacagcc cagcttggag cgaacgacct 5340 

acaccgaact gagataccta cagcgtgagc tatgagaaag cgccacgctt cccgaaggga 5400 

gaaaggcgga caggtatccg gtaagcggca gggtcggaac aggagagcgc acgagggagc 5460 

ttccaggggg aaacgcctgg tatctttata gtcctgtcgg gtttcgccac ctctgacttg 5520 

agcgtcgatt tttgtgatgc tcgtcagggg ggcggagcct atggaaaaac gccagcaacg 5580 

cggccttttt acggttcctg gccttttgct ggccttttgc tcacatgttc tttcctgcgt 5640 

tatcccctga ttctgtggat aaccgtatta ccgcctttga gtgagctgat accgctcgcc 5700 

gcagccgaac gaccgagcgc agcgagtcag tgagcgagga agcggaagag cgcccaatac 5760 

gcaaaccgcc tctccccgcg cgttggccga ttcattaatg cagctggcac gactagagtc 5820 

ccgctgaggc ggcgtagcag gtcagccgcc ccagcggtgg tcaccaaccg gggtggaacg 5880 

gcgccggtat cgggtgtgtc cgtggcgctc attccaacct ccgtgtgttt gtgcaggttt 5940 

cgcgtgttgc agtccctcgc accggcaccc gcagcgaggg gctcacgggt gccggtgggt 6000 

cgactagttc atcctcgaga tctaagcttg gatccgcggc cgctacgtag aattcccata 6060 

tggtgatggt gatggtggcc catggccgct cccttctctg acgccgtcca cgctgcctcc 6120 

tcacgtgacg tgaggtgcaa gcccggacgt tccgcgtgcc acgccgtgag ccgccgcgtg 6180 

ccgtcggctc cctcagcccg ggcggccgtg ggagcccgcc tcgatatgta cacccgagaa 6240 

gctcccagcg tcctcctggg ccgcgatact cgaccaccac gcacgcacac cgcactaacg 6300 

attcggccgg cgctcgattc ggccggcgct cgattcggcc ggcgctcgat tcggccggcg 6360 
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ctcgattcgg ccggcgctcg attcggccga gcagaagagt gaacaaccac cgaccacgct 6420 

tccgctctgc gcgccgtacc cgacctacct cccgcagctc gaagcagctc ccgggagtac 6480 

cgccgtactc acccgcctgt gctcaccatc caccgacgca aagcccaacc cgagcacacc 6540 

tcttgcacca aggtgccgac cgtggctttc cgctcgcagg gttccagaag aaatcgaacg 6600 

atccagcgcg gcaaggttca aaaagcaggg gttggtgggg aggaggtttt ggggggtgtc 6660 

gccgggatac ctgatatggc tttgttttgc gtagtcgaat aattttccat atagcctcgg 6720 

cgcgtcggac tcgaatagtt gatgtgggcg ggcacagttg ccccatgaaa tccgcaacgg 6780 

ggggcgtgct gagcgatcgg caatgggcgg atgcggtgtt gcttccgcac cggccgttcg 6840 

cgacgaacaa cctccaacga ggtcagtacc ggatgagccg cgacgacgca ttggcaatgc 6900 

ggtacgtcga gcattcaccg cacgcgttgc tcggatctat cgtcatcgac tgcgatcacg 6960 

ttgacgccgc gatgcgcgca ttcgagcaac catccgacca tccggcgccg aactgggtcg 7020 

cacaatcgcc gtccggccgc gcacacatcg gatggtggct cggccccaac cacgtgtgcc 7080 

gcaccgacag cgcccgactg acgccactgc gctacgccca ccgcatcgaa accggcctca 7140 

agatcagcgt cggcggcgat ttcgcgtatg gcgggcaact gaccaaaaac ccgattcacc 7200 

ccgattggga gacgatctac ggcccggcca ccccgtacac attgcggcag ctggccacca 7260 

tccacacacc ccggcagatg ccgcgtcggc ccgatcgggc cgtgggcctg ggccgcaacg 7320 

tcaccatgtt cgacgccacc cggcgatggg catacccgca gtggtggcaa caccgaaacg 7380 

gaaccggccg cgactgggac catctcgtcc tgcagcactg ccacgccgtc aacaccgagt 7440 

tcacgacacc actgccgttc accgaagtac gcgccaccgc gcaatccatc tccaaatgga 7500 

tctggcgcaa tttcaccgaa gaacagtacc gagcccgaca agcgcatctc ggtcaaaaag 7560 

gcggcaaggc aacgacactc gccaaacaag aagccgtccg aaacaatgca agaaagtacg 7620 

acgaacatac gatgcgagag gcgattatct gatgggcgga gccaaaaatc cggtgcgccg 7680 

aaagatgacg gcagcagcag cagccgaaaa attcggtgcc tccactcgca caatccaacg 7740 

cttgtttgct gagccgcgtg acgattacct cggccgtgcg aaagctcgcc gtgacaaagc 7800 

tgtcgagctg cggaagcagg ggttgaagta ccgggaaatc gccgaagcga tggaactctc 7860 

gaccgggatc gtcggccgat tactgcacga cgcccgcagg cacggcgaga tttcagcgga 7920 

ggatctgtcg gcgtaaccaa gtcagcgggt tgtcgggttc cggccggcgc tcggcactcg 7980 

gaccggccgg cggatggtgt tctgcctctg gcgcagcgtc agctaccgcc gaaggcctgt 8040 

catcgaccgg cttcgactga agtatgagca acgtcacagc ctgtgattgg atgatccgct 8100 
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cacgctcgac cgctacctgt tcagctgccg cccgctgggc atgagcaacg gccaactctc 8160 
gttcaa gjgg 



<210> 107 
<211> 8169 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: vector pTip-NH2 
<400> 107 

gagctcgacc gcgcgggtcc cggacgggga agagcgggga gctttgccag agagcgacga 60 

cttccccttg cgttggtgat tgccggtcag ggcagccatc cgccatcgtc gcgtagggtg 120 

tcacacccca ggaatcgcgt cactgaacac agcagccggt aggacgacca tgactgagtt 180 

ggacaccatc gcaaatccgt ccgatcccgc ggtgcagcgg atcatcgatg tcaccaagcc 240 

gtcacgatcc aacataaaga caacgttgat cgaggacgtc gagcccctca tgcacagcat 300 

cgcggccggg gtggagttca tcgaggtcta cggcagcgac agcagtcctt ttccatctga 360 

gttgctggat ctgtgcgggc ggcagaacat accggtccgc ctcatcgact cctcgatcgt 420 

caaccagttg ttcaaggggg agcggaaggc caagacattc ggcatcgccc gcgtccctcg 480 

cccggccagg ttcggcgata tcgcgagccg gcgtggggac gtcgtcgttc tcgacggggt 540 

gaagatcgtc gggaacatcg gcgcgatagt acgcacgtcg ctcgcgctcg gagcgtcggg 600 

gatcatcctg gtggacagtg acatcaccag catcgcggac cggcgtctcc aaagggccag 660 

ccgaggttac gtcttctccc ttcccgtcgt tctctccggt cgcgaggagg ccatcgcctt 720 

cattcgggac agcggtatgc agctgatgac gctcaaggcg gatggcgaca tttccgtgaa 780 

ggaactcggg gacaatccgg atcggctggc cttgctgttc ggcagcgaaa agggtgggcc 840 

ttccgacctg ttcgaggagg cgtcttccgc ctcggtttcc atccccatga tgagccagac 900 

cgagtctctc aacgtttccg tttccctcgg aatcgcgctg cacgagagga tcgacaggaa 960 

tctcgcggcc aaccgataag cgcctctgtt cctcggacgc tcggttcctc gacctcgatt 1020 

cgtcagtgat gatcacctca cacggcagcg atcaccactg acatatcgag gtcaacggtc 1080 
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gtggtccggg cgggcactcc tcgaaggcgc ggccgacgcc cttgaacgac tcgatgactc 1140 

tagagtaacg ggctactccg tttaacggac cccgttctca cgctttaggc ttgaccccgg 1200 

agcctgcatg gggcattccg ccgtgaaccc ggtggaatgc ccccggcacc cgggctttcc 1260 

agcaaagatc acctggcgcc gatgagtaag gcgtacagaa ccactccaca ggaggaccgt 1320 

cgagatgaaa tctaacaatg cgctcatcgt catcctcggc accgtcaccc tggatgctgt 1380 

aggcataggc ttggttatgc cggtactgcc gggcctcttg cgggatatcg tccattccga 1440 

cagcatcgcc agtcactatg gcgtgctgct agcgctatat gcgttgatgc aatttctatg 1500 

cgcacccgtt ctcggagcac tgtccgaccg ctttggccgc cgcccagtcc tgctcgcttc 1560 

gctacttgga gccactatcg actacgcgat catggcgacc acacccgtcc tgtggattct 1620 

ctacgccgga cgcatcgtgg ccggcatcac cggcgccaca ggtgcggttg ctggcgccta 1680 

tatcgccgac atcaccgatg gggaagatcg ggctcgccac ttcgggctca tgagcgcttg 1740 

tttcggcgtg ggtatggtgg caggccccgt ggccggggga ctgttgggcg ccatctcctt 1800 

gcatgcacca ttccttgcgg cggcggtgct caacggcctc aacctactac tgggctgctt 1860 

cctaatgcag gagtcgcata agggagagcg tcgtccgatg cccttgagag ccttcaaccc 1920 

agtcagctcc ttccggtggg cgcggggcat gactatcgtc gccgcactta tgactgtctt 1980 

ctttatcatg caactcgtag gacaggtgcc ggcagcgctc tgggtcattt tcggcgagga 2040 

ccgctttcgc tggagcgcga cgatgatcgg cctgtcgctt gcggtattcg gaatcttgca 2100 

cgccctcgct caagccttcg tcactggtcc cgccaccaaa cgtttcggcg agaagcaggc 2160 

cattatcgcc ggcatggcgg ccgacgcgct gggctacgtc ttgctggcgt tcgcgacgcg 2220 

aggctggatg gccttcccca ttatgattct tctcgcttcc ggcggcatcg ggatgcccgc 2280 

gttgcaggcc atgctgtcca ggcaggtaga tgacgaccat cagggacagc ttcaaggatc 2340 

gctcgcggct cttaccagcc taacttcgat cattggaccg ctgatcgtca cggcgattta 2400 

tgccgcctcg gcgagcacat ggaacgggtt ggcatggatt gtaggcgccg ccctatacct 2460 

tgtctgcctc cccgcgttgc gtcgcggtgc atggagccgg gccacctcga cctgaatgga 2520 

agccggcggc acctcgctaa cggattcacc actccaagaa ttggagccaa tcaattcttg 2580 

cggagaactg tgaatgcgca aaccaaccct tggcagaaca tatccatcgc gtccgccatc 2640 

tccagcagcc gcacgcggcg catctcgggc agcgttgggt cctggccacg ggtgcgcatg 2700 

atcgtgctcc tgtcgttgag gactagaatt gatctcctcg accgccaatt gggcatctga 2760 

gaatcatctg cgtttctcgc acgcaacgta cttgcaacgt tgcaactcct agtgttgtga 2820 
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atcacacccc accggggggt gggattgcag tcaccgattt ggtgggtgcg cccaggaaga 2880 

tcacgtttac ataggagctt gcaatgagct actccgtggg acaggtggcc ggcttcgccg 2940 

gagtgacggt gcgcacgctg caccactacg acgacatcgg cctgctcgta ccgagcgagc 3000 

gcagccacgc gggccaccgg cgctacagcg acgccgacct cgaccggctg cagcagatcc 3060 

tgttctaccg ggagctgggc ttcccgctcg acgaggtcgc cgccctgctc gacgacccgg 3120 

ccgcggaccc gcgcgcgcac ctgcgccgcc agcacgagct gctgtccgcc cggatcggga 3180 

aactgcagaa gatggcggcg gccgtggagc aggcgatgga ggcacgcagc atgggaatca 3240 

acctcacccc ggaggagaag ttcgaggtct tcggcgactt cgaccccgac cagtacgagg 3300 

aggaggtccg ggaacgctgg gggaacaccg acgcctaccg ccagtccaag gagaagaccg 3360. 

cctcgtacac caaggaggac tggcagcgca tccaggacga ggccgacgag ctcacccggc 3420 

gcttcgtcgc cctgatggac gcgggtgagc ccgccgactc cgagggggcg atggacgccg 3480 

ccgaggacca ccggcagggc atcgcccgca accactacga ctgcgggtac gagatgcaca 3540 

cctgcctggg cgagatgtac gtgtccgacg aacgtttcac gcgaaacatc gacgccgcca 3600 

agccgggcct cgccgcctac atgcgcgacg cgatcctcgc caacgccgtc cggcacaccc 3660 

cctgagcggt ggtcgtggcc cgggtctccc gcccggtctc accccacggc tcactcccgg 3720 

gccacgacca ccgccgtccc gtacgcgcac acctcggtgc ccacgtccgc cgcctccgtc 3780 

acgtcgaaac ggaagatccc cgggtaccga gctcgtcagg tggcactttt cggggaaatg 3840 

tgcgcggaac ccctatttgt ttatttttct aaatacattc aaatatgtat ccgctcatga 3900 

gacaataacc ctgataaatg cttcaataat attgaaaaag gaagagtatg agtattcaac 3960 

atttccgtgt cgcccttatt cccttttttg cggcattttg ccttcctgtt tttgctcacc 4020 

cagaaacgct ggtgaaagta aaagatgctg aagatcagtt gggtgcacga gtgggttaca 4080 

tcgaactgga tctcaacagc ggtaagatcc ttgagagttt tcgccccgaa gaacgttttc 4140 

caatgatgag cacttttaaa gttctgctat gtggcgcggt attatcccgt attgacgccg 4200 

ggcaagagca actcggtcgc cgcatacact attctcagaa tgacttggtt gagtactcac 4260 

cagtcacaga aaagcatctt acggatggca tgacagtaag agaattatgc agtgctgcca 4320 

taaccatgag tgataacact gcggccaact tacttctgac aacgatcgga ggaccgaagg 4380 

agctaaccgc ttttttgcac aacatggggg atcatgtaac tcgccttgat cgttgggaac 4440 

cggagctgaa tgaagccata ccaaacgacg agcgtgacac cacgatgcct gtagcaatgg 4500 

caacaacgtt gcgcaaacta ttaactggcg aactacttac tctagcttcc cggcaacaat 4560 
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taatagactg gatggaggcg gataaagttg caggaccact tctgcgctcg gcccttccgg 4620 

ctggctggtt tattgctgat aaatctggag ccggtgagcg tgggtctcgc ggtatcattg 4680 

cagcactggg gccagatggt aagccctccc gtatcgtagt tatctacacg acggggagtc 4740 

aggcaactat ggatgaacga aatagacaga tcgctgagat aggtgcctca ctgattaagc 4800 

attggtaact gtcagaccaa gtttactcat atatacttta gattgattta aaacttcatt 4860 

tttaatttaa aaggatctag gtgaagatcc tttttgataa tctcatgacc aaaatccctt 4920 

aacgtgagtt ttcgttccac tgagcgtcag accccgtaga aaagatcaaa ggatcttctt 4980 

gagatccttt ttttctgcgc gtaatctgct gcttgcaaac aaaaaaacca ccgctaccag 5040 

cggtggtttg tttgccggat caagagctac caactctttt tccgaaggta actggcttca 5100 

gcagagcgca gataccaaat actgttcttc tagtgtagcc gtagttaggc caccacttca 5160 

agaactctgt agcaccgcct acatacctcg ctctgctaat cctgttacca gtggctgctg 5220 

ccagtggcga taagtcgtgt cttaccgggt tggactcaag acgatagtta ccggataagg 5280 

cgcagcggtc gggctgaacg gggggttcgt gcacacagcc cagcttggag cgaacgacct 5340 

acaccgaact gagataccta cagcgtgagc tatgagaaag cgccacgctt cccgaaggga 5400 

gaaaggcgga caggtatccg gtaagcggca gggtcggaac aggagagcgc acgagggagc 5460 

ttccaggggg aaacgcctgg tatctttata gtcctgtcgg gtttcgccac ctctgacttg 5520 

agcgtcgatt tttgtgatgc tcgtcagggg ggcggagcct atggaaaaac gccagcaacg 5580 

cggccttttt acggttcctg gccttttgct ggccttttgc tcacatgttc tttcctgcgt 5640 

tatcccctga ttctgtggat aaccgtatta ccgcctttga gtgagctgat accgctcgcc 5700 

gcagccgaac gaccgagcgc agcgagtcag tgagcgagga agcggaagag cgcccaatac 5760 

gcaaaccgcc tctccccgcg cgttggccga ttcattaatg cagctggcac gactagagtc 5820 

ccgctgaggc ggcgtagcag gtcagccgcc ccagcggtgg tcaccaaccg gggtggaacg 5880 

gcgccggtat cgggtgtgtc cgtggcgctc attccaacct ccgtgtgttt gtgcaggttt 5940 

cgcgtgttgc agtccctcgc accggcaccc gcagcgaggg gctcacgggt gccggtgggt 6000 

cgactagttc atcctcgaga tctaagcttg gatccgcggc cgctacgtag aattcccatg 6060 

gcgtgatggt gatggtgatg gcccatatgc gctcccttct ctgacgccgt ccacgctgcc 6120 

tcctcacgtg acgtgaggtg caagcccgga cgttccgcgt gccacgccgt gagccgccgc 6180 

gtgccgtcgg ctccctcagc ccgggcggcc gtgggagccc gcctcgatat gtacacccga 6240 

gaagctccca gcgtcctcct gggccgcgat actcgaccac cacgcacgca caccgcacta 6300 
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acgattcggc cggcgctcga ttcggccggc gctcgattcg gccggcgctc gattcggccg 6360 

gcgctcgatt cggccggcgc tcgattcggc cgagcagaag agtgaacaac caccgaccac 6420 

gcttccgctc tgcgcgccgt acccgaccta cctcccgcag ctcgaagcag ctcccgggag 6480 

taccgccgta ctcacccgcc tgtgctcacc atccaccgac gcaaagccca acccgagcac 6540 

acctcttgca ccaaggtgcc gaccgtggct ttccgctcgc agggttccag aagaaatcga 6600 

acgatccagc gcggcaaggt tcaaaaagca ggggttggtg gggaggaggt tttggggggt 6660 

gtcgccggga tacctgatat ggctttgttt tgcgtagtcg aataattttc catatagcct 6720 

cggcgcgtcg gactcgaata gttgatgtgg gcgggcacag ttgccccatg aaatccgcaa 6780 

cggggggcgt gctgagcgat cggcaatggg cggatgcggt gttgcttccg caccggccgt 6840 

tcgcgacgaa caacctccaa cgaggtcagt accggatgag ccgcgacgac gcattggcaa 6900 

tgcggtacgt cgagcattca ccgcacgcgt tgctcggatc tatcgtcatc gactgcgatc 6960 

acgttgacgc cgcgatgcgc gcattcgagc aaccatccga ccatccggcg ccgaactggg 7020 

tcgcacaatc gccgtccggc cgcgcacaca tcggatggtg gctcggcccc aaccacgtgt 7080 

gccgcaccga cagcgcccga ctgacgccac tgcgctacgc ccaccgcatc gaaaccggcc 7140 

tcaagatcag cgtcggcggc gatttcgcgt atggcgggca actgaccaaa aacccgattc 7200 

accccgattg ggagacgatc tacggcccgg ccaccccgta cacattgcgg cagctggcca 7260 

ccatccacac accccggcag atgccgcgtc ggcccgatcg ggccgtgggc ctgggccgca 7320 

acgtcaccat gttcgacgcc acccggcgat gggcataccc gcagtggtgg caacaccgaa 7380 

acggaaccgg ccgcgactgg gaccatctcg tcctgcagca ctgccacgcc gtcaacaccg 7440 

agttcacgac accactgccg ttcaccgaag tacgcgccac cgcgcaatcc atctccaaat 7500 

ggatctggcg caatttcacc gaagaacagt accgagcccg acaagcgcat ctcggtcaaa 7560 

aaggcggcaa ggcaacgaca ctcgccaaac aagaagccgt ccgaaacaat gcaagaaagt 7620 

acgacgaaca tacgatgcga gaggcgatta tctgatgggc ggagccaaaa atccggtgcg 7680 

ccgaaagatg acggcagcag cagcagccga aaaattcggt gcctccactc gcacaatcca 7740 

acgcttgttt gctgagccgc gtgacgatta cctcggccgt gcgaaagctc gccgtgacaa 7800 

agctgtcgag ctgcggaagc aggggttgaa gtaccgggaa atcgccgaag cgatggaact 7860 

ctcgaccggg atcgtcggcc gattactgca cgacgcccgc aggcacggcg agatttcagc 7920 

ggaggatctg tcggcgtaac caagtcagcg ggttgtcggg ttccggccgg cgctcggcac 7980 

tcggaccggc cggcggatgg tgttctgcct ctggcgcagc gtcagctacc gccgaaggcc 8040 
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tgtcatcgac cggcttcgac tgaagtatga gcaacgtcac agcctgtgat tggatgatcc 8100 
gctcacgctc gaccgctacc tgttcagctg ccgcccgctg ggcatgagca acggccaact 8160 
ctcgttcaa 8169 

<210> 108 
<211> 8160 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: vector pTip-CHl 



<400> 108 



gagctcgacc 


gcgcgggtcc 


cggacgggga 


agagcgggga 


gctttgccag 


agagcgacga 


I 60 


cttccccttg 


cgttggtgat 


tgccggtcag 


ggcagccatc 


cgccatcgtc 


gcgtagggtg 


120 


tcacacccca 


ggaatcgcgt 


cactgaacac 


agcagccggt 


aggacgacca 


tgactgagtt 


180 


ggacaccatc 


gcaaatccgt 


ccgatcccgc 


ggtgcagcgg 


atcatcgatg 


tcaccaagcc 


240 


gtcacgatcc 


aacataaaga 


caacgttgat 


cgaggacgtc 


gagcccctca 


tgcacagcat 


300 


cgcggccggg 


gtggagttca 


tcgaggtcta 


cggcagcgac 


agcagtcctt 


ttccatctga 


360 


gttgctggat 


ctgtgcgggc 


ggcagaacat 


accggtccgc 


ctcatcgact 


cctcgatcgt 


420 


caaccagttg 


ttcaaggggg 


agcggaaggc 


caagacattc 


ggcatcgccc 


gcgtccctcg 


480 


cccggccagg 


ttcggcgata 


tcgcgagccg 


gcgtggggac 


gtcgtcgttc 


tcgacggggt 


540 


gaagatcgtc 


gggaacatcg 


gcgcgatagt 


acgcacgtcg 


ctcgcgctcg 


gagcgtcggg 


600 


gatcatcctg 


gtggacagtg 


acatcaccag 


catcgcggac 


cggcgtctcc 


aaagggccag 


660 


ccgaggttac 


gtcttctccc 


ttcccgtcgt 


tctctccggt 


cgcgaggagg 


ccatcgcctt 


720 


cattcgggac 


agcggtatgc 


agctgatgac 


gctcaaggcg 


gatggcgaca 


tttccgtgaa 


780 


ggaactcggg 


gacaatccgg 


atcggctggc 


cttgctgttc 


ggcagcgaaa 


agggtgggcc 


840 


ttccgacctg 


ttcgaggagg 


cgtcttccgc 


ctcggtttcc 


atccccatga 


tgagccagac 


900 


cgagtctctc 


aacgtttccg 


tttccctcgg 


aatcgcgctg 


cacgagagga 


tcgacaggaa 


960 


tctcgcggcc 


aaccgataag 


cgcctctgtt 


cctcggacgc tcggttcctc gacctcgatt 1020 
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cgtcagtgat gatcacctca cacggcagcg atcaccactg acatatcgag gtcaacggtc 1080 

gtggtccggg cgggcactcc tcgaaggcgc ggccgacgcc cttgaacgac tcgatgactc 1140 

tagagtaacg ggctactccg tttaacggac cccgttctca cgctttaggc ttgaccccgg 1200 

agcctgcatg gggcattccg ccgtgaaccc ggtggaatgc ccccggcacc cgggctttcc 1260 

agcaaagatc acctggcgcc gatgagtaag gcgtacagaa ccactccaca ggaggaccgt 1320 

cgagatgaaa tctaacaatg cgctcatcgt catcctcggc accgtcaccc tggatgctgt 1380 

aggcataggc ttggttatgc cggtactgcc gggcctcttg cgggatatcg tccattccga 1440 

cagcatcgcc agtcactatg gcgtgctgct agcgctatat gcgttgatgc aatttctatg 1500 

cgcacccgtt ctcggagcac tgtccgaccg ctttggccgc cgcccagtcc tgctcgcttc 1560 

gctacttgga gccactatcg actacgcgat catggcgacc acacccgtcc tgtggattct 1620 

ctacgccgga cgcatcgtgg ccggcatcac cggcgccaca ggtgcggttg ctggcgccta 1680 

tatcgccgac atcaccgatg gggaagatcg ggctcgccac ttcgggctca tgagcgcttg 1740 

tttcggcgtg ggtatggtgg caggccccgt ggccggggga ctgttgggcg ccatctcctt 1800 

gcatgcacca ttccttgcgg cggcggtgct caacggcctc aacctactac tgggctgctt 1860 

cctaatgcag gagtcgcata agggagagcg tcgtccgatg cccttgagag ccttcaaccc 1920 

agtcagctcc ttccggtggg cgcggggcat gactatcgtc gccgcactta tgactgtctt 1980 

ctttatcatg caactcgtag gacaggtgcc ggcagcgctc tgggtcattt tcggcgagga 2040 

ccgctttcgc tggagcgcga cgatgatcgg cctgtcgctt gcggtattcg gaatcttgca 2100 

cgccctcgct caagccttcg tcactggtcc cgccaccaaa cgtttcggcg agaagcaggc 2160 

cattatcgcc ggcatggcgg ccgacgcgct gggctacgtc ttgctggcgt tcgcgacgcg 2220 

aggctggatg gccttcccca ttatgattct tctcgcttcc ggcggcatcg ggatgcccgc 2280 

gttgcaggcc atgctgtcca ggcaggtaga tgacgaccat cagggacagc ttcaaggatc 2340 

gctcgcggct cttaccagcc taacttcgat cattggaccg ctgatcgtca cggcgattta 2400 

tgccgcctcg gcgagcacat ggaacgggtt ggcatggatt gtaggcgccg ccctatacct 2460 

tgtctgcctc cccgcgttgc gtcgcggtgc atggagccgg gccacctcga cctgaatgga 2520 

agccggcggc acctcgctaa cggattcacc actccaagaa ttggagccaa tcaattcttg 2580 

cggagaactg tgaatgcgca aaccaaccct tggcagaaca tatccatcgc gtccgccatc 2640 

tccagcagcc gcacgcggcg catctcgggc agcgttgggt cctggccacg ggtgcgcatg 2700 

atcgtgctcc tgtcgttgag gactagaatt gatctcctcg accgccaatt gggcatctga 2760 
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gaatcatctg cgtttctcgc acgcaacgta cttgcaacgt tgcaactcct agtgttgtga 2820 

atcacacccc accggggggt gggattgcag tcaccgattt ggtgggtgcg cccaggaaga 2880 

tcacgtttac ataggagctt gcaatgagct actccgtggg acaggtggcc ggcttcgccg 2940 

gagtgacggt gcgcacgctg caccactacg acgacatcgg cctgctcgta ccgagcgagc 3000 

gcagccacgc gggccaccgg cgctacagcg acgccgacct cgaccggctg cagcagatcc 3060 

tgttctaccg ggagctgggc ttcccgctcg acgaggtcgc cgccctgctc gacgacccgg 3120 

ccgcggaccc gcgcgcgcac ctgcgccgcc agcacgagct gctgtccgcc cggatcggga 3180 

aactgcagaa gatggcggcg gccgtggagc aggcgatgga ggcacgcagc atgggaatca 3240 

acctcacccc ggaggagaag ttcgaggtct tcggcgactt cgaccccgac cagtacgagg 3300 

aggaggtccg ggaacgctgg gggaacaccg acgcctaccg ccagtccaag gagaagaccg 3360 

cctcgtacac caaggaggac tggcagcgca tccaggacga ggccgacgag ctcacccggc 3420 

gcttcgtcgc cctgatggac gcgggtgagc ccgccgactc cgagggggcg atggacgccg 3480 

ccgaggacca ccggcagggc atcgcccgca accactacga ctgcgggtac gagatgcaca 3540 

cctgcctggg cgagatgtac gtgtccgacg aacgtttcac gcgaaacatc gacgccgcca 3600 

agccgggcct cgccgcctac atgcgcgacg cgatcctcgc caacgccgtc cggcacaccc 3660 

cctgagcggt ggtcgtggcc cgggtctccc gcccggtctc accccacggc tcactcccgg 3720 

gccacgacca ccgccgtccc gtacgcgcac acctcggtgc ccacgtccgc cgcctccgtc 3780 

acgtcgaaac ggaagatccc cgggtaccga gctcgtcagg tggcactttt cggggaaatg 3840 

tgcgcggaac ccctatttgt ttatttttct aaatacattc aaatatgtat ccgctcatga 3900 

gacaataacc ctgataaatg cttcaataat attgaaaaag gaagagtatg agtattcaac 3960 

atttccgtgt cgcccttatt cccttttttg cggcattttg ccttcctgtt tttgctcacc 4020 

cagaaacgct ggtgaaagta aaagatgctg aagatcagtt gggtgcacga gtgggttaca 4080 

tcgaactgga tctcaacagc ggtaagatcc ttgagagttt tcgccccgaa gaacgttttc 4140 

caatgatgag cacttttaaa gttctgctat gtggcgcggt attatcccgt attgacgccg 4200 

ggcaagagca actcggtcgc cgcatacact attctcagaa tgacttggtt gagtactcac 4260 

cagtcacaga aaagcatctt acggatggca tgacagtaag agaattatgc agtgctgcca 4320 

taaccatgag tgataacact gcggccaact tacttctgac aacgatcgga ggaccgaagg 4380 

agctaaccgc ttttttgcac aacatggggg atcatgtaac tcgccttgat cgttgggaac 4440 

cggagctgaa tgaagccata ccaaacgacg agcgtgacac cacgatgcct gtagcaatgg 4500 
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caacaacgtt gcgcaaacta ttaactggcg aactacttac tctagcttcc cggcaacaat 4560 

taatagactg gatggaggcg gataaagttg caggaccact tctgcgctcg gcccttccgg 4620 

ctggctggtt tattgctgat aaatctggag ccggtgagcg tgggtctcgc ggtatcattg 4680 

cagcactggg gccagatggt aagccctccc gtatcgtagt tatctacacg acggggagtc 4740 

aggcaactat ggatgaacga aatagacaga tcgctgagat aggtgcctca ctgattaagc 4800 

attggtaact gtcagaccaa gtttactcat atatacttta gattgattta aaacttcatt 4860 

tttaatttaa aaggatctag gtgaagatcc tttttgataa tctcatgacc aaaatccctt 4920 

aacgtgagtt ttcgttccac tgagcgtcag accccgtaga aaagatcaaa ggatcttctt 4980 

gagatccttt ttttctgcgc gtaatctgct gcttgcaaac aaaaaaacca ccgctaccag 5040 

cggtggtttg tttgccggat caagagctac caactctttt tccgaaggta actggcttca 5100 

gcagagcgca gataccaaat actgttcttc tagtgtagcc gtagttaggc caccacttca 5160 

agaactctgt agcaccgcct acatacctcg ctctgctaat cctgttacca gtggctgctg 5220 

ccagtggcga taagtcgtgt cttaccgggt tggactcaag acgatagtta ccggataagg 5280 

cgcagcggtc gggctgaacg gggggttcgt gcacacagcc cagcttggag cgaacgacct 5340 

acaccgaact gagataccta cagcgtgagc tatgagaaag cgccacgctt cccgaaggga 5400 

gaaaggcgga caggtatccg gtaagcggca gggtcggaac aggagagcgc acgagggagc 5460 

ttccaggggg aaacgcctgg tatctttata gtcctgtcgg gtttcgccac ctctgacttg 5520 

agcgtcgatt tttgtgatgc tcgtcagggg ggcggagcct atggaaaaac gccagcaacg 5580 

cggccttttt acggttcctg gccttttgct ggccttttgc tcacatgttc tttcctgcgt 5640 

tatcccctga ttctgtggat aaccgtatta ccgcctttga gtgagctgat accgctcgcc 5700 

gcagccgaac gaccgagcgc agcgagtcag tgagcgagga agcggaagag cgcccaatac 5760 

gcaaaccgcc tctccccgcg cgttggccga ttcattaatg cagctggcac gactagagtc 5820 

ccgctgaggc ggcgtagcag gtcagccgcc ccagcggtgg tcaccaaccg gggtggaacg 5880 

gcgccggtat cgggtgtgtc cgtggcgctc attccaacct ccgtgtgttt gtgcaggttt 5940 

cgcgtgttgc agtccctcgc accggcaccc gcagcgaggg gctcacgggt gccggtgggt 6000 

cgactagttc agtgatggtg atggtgatgt cctcgagatc taagcttgga tccgcggccg 6060 

ctacgtagaa ttcccatggc cgctcccttc tctgacgccg tccacgctgc ctcctcacgt 6120 

gacgtgaggt gcaagcccgg acgttccgcg tgccacgccg tgagccgccg cgtgccgtcg 6180 

gctccctcag cccgggcggc cgtgggagcc cgcctcgata tgtacacccg agaagctccc 6240 
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agcgtcctcc tgggccgcga tactcgacca ccacgcacgc acaccgcact aacgattcgg 6300 

ccggcgctcg attcggccgg cgctcgattc ggccggcgct cgattcggcc ggcgctcgat 6360 

tcggccggcg ctcgattcgg ccgagcagaa gagtgaacaa ccaccgacca cgcttccgct 6420 

ctgcgcgccg tacccgacct acctcccgca gctcgaagca gctcccggga gtaccgccgt 6480 

actcacccgc ctgtgctcac catccaccga cgcaaagccc aacccgagca cacctcttgc 6540 

accaaggtgc cgaccgtggc tttccgctcg cagggttcca gaagaaatcg aacgatccag 6600 

cgcggcaagg ttcaaaaagc aggggttggt ggggaggagg ttttgggggg tgtcgccggg 6660 

atacctgata tggctttgtt ttgcgtagtc gaataatttt ccatatagcc tcggcgcgtc 6720 

ggactcgaat agttgatgtg ggcgggcaca gttgccccat gaaatccgca acggggggcg 6780 

tgctgagcga tcggcaatgg gcggatgcgg tgttgcttcc gcaccggccg ttcgcgacga 6840 

acaacctcca acgaggtcag taccggatga gccgcgacga cgcattggca atgcggtacg 6900 

tcgagcattc accgcacgcg ttgctcggat ctatcgtcat cgactgcgat cacgttgacg 6960 

ccgcgatgcg cgcattcgag caaccatccg accatccggc gccgaactgg gtcgcacaat 7020 

cgccgtccgg ccgcgcacac atcggatggt ggctcggccc caaccacgtg tgccgcaccg 7080 

acagcgcccg actgacgcca ctgcgctacg cccaccgcat cgaaaccggc ctcaagatca 7140 

gcgtcggcgg cgatttcgcg tatggcgggc aactgaccaa aaacccgatt caccccgatt 7200 

gggagacgat ctacggcccg gccaccccgt acacattgcg gcagctggcc accatccaca 7260 

caccccggca gatgccgcgt cggcccgatc gggccgtggg cctgggccgc aacgtcacca 7320 

tgttcgacgc cacccggcga tgggcatacc cgcagtggtg gcaacaccga aacggaaccg 7380 

gccgcgactg ggaccatctc gtcctgcagc actgccacgc cgtcaacacc gagttcacga 7440 

caccactgcc gttcaccgaa gtacgcgcca ccgcgcaatc catctccaaa tggatctggc 7500 

gcaatttcac cgaagaacag taccgagccc gacaagcgca tctcggtcaa aaaggcggca 7560 

aggcaacgac actcgccaaa caagaagccg tccgaaacaa tgcaagaaag tacgacgaac 7620 

atacgatgcg agaggcgatt atctgatggg cggagccaaa aatccggtgc gccgaaagat 7680 

gacggcagca gcagcagccg aaaaattcgg tgcctccact cgcacaatcc aacgcttgtt 7740 

tgctgagccg cgtgacgatt acctcggccg tgcgaaagct cgccgtgaca aagctgtcga 7800 

gctgcggaag caggggttga agtaccggga aatcgccgaa gcgatggaac tctcgaccgg 7860 

gatcgtcggc cgattactgc acgacgcccg caggcacggc gagatttcag cggaggatct 7920 

gtcggcgtaa ccaagtcagc gggttgtcgg gttccggccg gcgctcggca ctcggaccgg 7980 
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ccggcggatg gtgttctgcc tctggcgcag cgtcagctac cgccgaaggc ctgtcatcga 8040 
ccggcttcga ctgaagtatg agcaacgtca cagcctgtga ttggatgatc cgctcacgct 8100 
cgaccgctac ctgttcagct gccgcccgct gggcatgagc aacggccaac tctcgttcaa 8160 

<210> 109 
<211> 8160 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: vector pTip-CH2 



<400> 109 



gagctcgacc 


gcgcgggtcc 


cggacgggga 


agagcgggga 


gctttgccag 


: agagcgacga 60 


cttccccttg 


cgttggtgat 


tgccggtcag 


ggcagccatc 


cgccatcgtc 


gcgtagggtg 


120 


tcacacccca 


ggaatcgcgt 


cactgaacac 


agcagccggt 


aggacgacca 


tgactgagtt 


180 


ggacaccatc 


gcaaatccgt 


ccgatcccgc 


ggtgcagcgg 


atcatcgatg 


tcaccaagcc 


240 


gtcacgatcc 


aacataaaga 


caacgttgat 


cgaggacgtc 


gagcccctca 


tgcacagcat 


300 


cgcggccggg 


gtggagttca 


tcgaggtcta 


cggcagcgac 


agcagtcctt 


ttccatctga 


360 


gttgctggat 


ctgtgcgggc 


ggcagaacat 


accggtccgc 


ctcatcgact 


cctcgatcgt 


420 


caaccagttg 


ttcaaggggg 


agcggaaggc 


caagacattc 


ggcatcgccc 


gcgtccctcg 


480 


cccggccagg 


ttcggcgata 


tcgcgagccg 


gcgtggggac 


gtcgtcgttc 


tcgacggggt 


540 


gaagatcgtc 


gggaacatcg 


gcgcgatagt 


acgcacgtcg 


ctcgcgctcg 


gagcgtcggg 


600 


gatcatcctg 


gtggacagtg 


acatcaccag 


catcgcggac 


cggcgtctcc 


aaagggccag 


660 


ccgaggttac 


gtcttctccc 


ttcccgtcgt 


tctctccggt 


cgcgaggagg 


ccatcgcctt 


720 


cattcgggac 


agcggtatgc 


agctgatgac 


gctcaaggcg 


gatggcgaca 


tttccgtgaa 


780 


ggaactcggg 


gacaatccgg 


atcggctggc 


cttgctgttc 


ggcagcgaaa 


agggtgggcc 


840 


ttccgacctg 


ttcgaggagg 


cgtcttccgc 


ctcggtttcc 


atccccatga 


tgagccagac 


900 


cgagtctctc 


aacgtttccg 


tttccctcgg 


aatcgcgctg 


cacgagagga 


tcgacaggaa 


960 
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tctcgcggcc aaccgataag cgcctctgtt cctcggacgc tcggttcctc gacctcgatt 1020 

cgtcagtgat gatcacctca cacggcagcg atcaccactg acatatcgag gtcaacggtc 1080 

gtggtccggg cgggcactcc tcgaaggcgc ggccgacgcc cttgaacgac tcgatgactc 1140 

tagagtaacg ggctactccg tttaacggac cccgttctca cgctttaggc ttgaccccgg 1200 

agcctgcatg gggcattccg ccgtgaaccc ggtggaatgc ccccggcacc cgggctttcc 1260 

agcaaagatc acctggcgcc gatgagtaag gcgtacagaa ccactccaca ggaggaccgt 1320 

cgagatgaaa tctaacaatg cgctcatcgt catcctcggc accgtcaccc tggatgctgt 1380 

aggcataggc ttggttatgc cggtactgcc gggcctcttg cgggatatcg tccattccga 1440 

cagcatcgcc agtcactatg gcgtgctgct agcgctatat gcgttgatgc aatttctatg 1500 

cgcacccgtt ctcggagcac tgtccgaccg ctttggccgc cgcccagtcc tgctcgcttc 1560 

gctacttgga gccactatcg actacgcgat catggcgacc acacccgtcc tgtggattct 1620 

ctacgccgga cgcatcgtgg ccggcatcac cggcgccaca ggtgcggttg ctggcgccta 1680 

tatcgccgac atcaccgatg gggaagatcg ggctcgccac ttcgggctca tgagcgcttg 1740 

tttcggcgtg ggtatggtgg caggccccgt ggccggggga ctgttgggcg ccatctcctt 1800 

gcatgcacca ttccttgcgg cggcggtgct caacggcctc aacctactac tgggctgctt 1860 

cctaatgcag gagtcgcata agggagagcg tcgtccgatg cccttgagag ccttcaaccc 1920 

agtcagctcc ttccggtggg cgcggggcat gactatcgtc gccgcactta tgactgtctt 1980 

ctttatcatg caactcgtag gacaggtgcc ggcagcgctc tgggtcattt tcggcgagga 2040 

ccgctttcgc tggagcgcga cgatgatcgg cctgtcgctt gcggtattcg gaatcttgca 2100 

cgccctcgct caagccttcg tcactggtcc cgccaccaaa cgtttcggcg agaagcaggc 2160 

cattatcgcc ggcatggcgg ccgacgcgct gggctacgtc ttgctggcgt tcgcgacgcg 2220 

aggctggatg gccttcccca ttatgattct tctcgcttcc ggcggcatcg ggatgcccgc 2280 

gttgcaggcc atgctgtcca ggcaggtaga tgacgaccat cagggacagc ttcaaggatc 2340 

gctcgcggct cttaccagcc taacttcgat cattggaccg ctgatcgtca cggcgattta 2400 

tgccgcctcg gcgagcacat ggaacgggtt ggcatggatt gtaggcgccg ccctatacct 2460 

tgtctgcctc cccgcgttgc gtcgcggtgc atggagccgg gccacctcga cctgaatgga 2520 

agccggcggc acctcgctaa cggattcacc actccaagaa ttggagccaa tcaattcttg 2580 

cggagaactg tgaatgcgca aaccaaccct tggcagaaca tatccatcgc gtccgccatc 2640 

tccagcagcc gcacgcggcg catctcgggc agcgttgggt cctggccacg ggtgcgcatg 2700 
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atcgtgctcc tgtcgttgag gactagaatt gatctcctcg accgccaatt gggcatctga 2760 

gaatcatctg cgtttctcgc acgcaacgta cttgcaacgt tgcaactcct agtgttgtga 2820 

atcacacccc accggggggt gggattgcag tcaccgattt ggtgggtgcg cccaggaaga 2880 

tcacgtttac ataggagctt gcaatgagct actccgtggg acaggtggcc ggcttcgccg 2940 

gagtgacggt gcgcacgctg caccactacg acgacatcgg cctgctcgta ccgagcgagc 3000 

gcagccacgc gggccaccgg cgctacagcg acgccgacct cgaccggctg cagcagatcc 3060 

tgttctaccg ggagctgggc ttcccgctcg acgaggtcgc cgccctgctc gacgacccgg 3120 

ccgcggaccc gcgcgcgcac ctgcgccgcc agcacgagct gctgtccgcc cggatcggga 3180 

aactgcagaa gatggcggcg gccgtggagc aggcgatgga ggcacgcagc atgggaatca 3240 

acctcacccc ggaggagaag ttcgaggtct tcggcgactt cgaccccgac cagtacgagg 3300 

aggaggtccg ggaacgctgg gggaacaccg acgcctaccg ccagtccaag gagaagaccg 3360 

cctcgtacac caaggaggac tggcagcgca tccaggacga ggccgacgag ctcacccggc 3420 

gcttcgtcgc cctgatggac gcgggtgagc ccgccgactc cgagggggcg atggacgccg 3480 

ccgaggacca ccggcagggc atcgcccgca accactacga ctgcgggtac gagatgcaca 3540 

cctgcctggg cgagatgtac gtgtccgacg aacgtttcac gcgaaacatc gacgccgcca 3600 

agccgggcct cgccgcctac atgcgcgacg cgatcctcgc caacgccgtc cggcacaccc 3660 

cctgagcggt ggtcgtggcc cgggtctccc gcccggtctc accccacggc tcactcccgg 3720 

gccacgacca ccgccgtccc gtacgcgcac acctcggtgc ccacgtccgc cgcctccgtc 3780 

acgtcgaaac ggaagatccc cgggtaccga gctcgtcagg tggcactttt cggggaaatg 3840 

tgcgcggaac ccctatttgt ttatttttct aaatacattc aaatatgtat ccgctcatga 3900 

gacaataacc ctgataaatg cttcaataat attgaaaaag gaagagtatg agtattcaac 3960 

atttccgtgt cgcccttatt cccttttttg cggcattttg ccttcctgtt tttgctcacc 4020 

cagaaacgct ggtgaaagta aaagatgctg aagatcagtt gggtgcacga gtgggttaca 4080 

tcgaactgga tctcaacagc ggtaagatcc ttgagagttt tcgccccgaa gaacgttttc 4140 

caatgatgag cacttttaaa gttctgctat gtggcgcggt attatcccgt attgacgccg 4200 

ggcaagagca actcggtcgc cgcatacact attctcagaa tgacttggtt gagtactcac 4260 

cagtcacaga aaagcatctt acggatggca tgacagtaag agaattatgc agtgctgcca 4320 

taaccatgag tgataacact gcggccaact tacttctgac aacgatcgga ggaccgaagg 4380 

agctaaccgc ttttttgcac aacatggggg atcatgtaac tcgccttgat cgttgggaac 4440 
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cggagctgaa tgaagccata ccaaacgacg agcgtgacac cacgatgcct gtagcaatgg 4500 

caacaacgtt gcgcaaacta ttaactggcg aactacttac tctagcttcc cggcaacaat 4560 

taatagactg gatggaggcg gataaagttg caggaccact tctgcgctcg gcccttccgg 4620 

ctggctggtt tattgctgat aaatctggag ccggtgagcg tgggtctcgc ggtatcattg 4680 

cagcactggg gccagatggt aagccctccc gtatcgtagt tatctacacg acggggagtc 4740 

aggcaactat ggatgaacga aatagacaga tcgctgagat aggtgcctca ctgattaagc 4800 

attggtaact gtcagaccaa gtttactcat atatacttta gattgattta aaacttcatt 4860 

tttaatttaa aaggatctag gtgaagatcc tttttgataa tctcatgacc aaaatccctt 4920 

aacgtgagtt ttcgttccac tgagcgtcag accccgtaga aaagatcaaa ggatcttctt 4980 

gagatccttt ttttctgcgc gtaatctgct gcttgcaaac aaaaaaacca ccgctaccag 5040 

cggtggtttg tttgccggat caagagctac caactctttt tccgaaggta actggcttca 5100 

gcagagcgca gataccaaat actgttcttc tagtgtagcc gtagttaggc caccacttca 5160 

agaactctgt agcaccgcct acatacctcg ctctgctaat cctgttacca gtggctgctg 5220 

ccagtggcga taagtcgtgt cttaccgggt tggactcaag acgatagtta ccggataagg 5280 

cgcagcggtc gggctgaacg gggggttcgt gcacacagcc cagcttggag cgaacgacct 5340 

acaccgaact gagataccta cagcgtgagc tatgagaaag cgccacgctt cccgaaggga 5400 

gaaaggcgga caggtatccg gtaagcggca gggtcggaac aggagagcgc acgagggagc 5460 

ttccaggggg aaacgcctgg tatctttata gtcctgtcgg gtttcgccac ctctgacttg 5520 

agcgtcgatt tttgtgatgc tcgtcagggg ggcggagcct atggaaaaac gccagcaacg 5580 

cggccttttt acggttcctg gccttttgct ggccttttgc tcacatgttc tttcctgcgt 5640 

tatcccctga ttctgtggat aaccgtatta ccgcctttga gtgagctgat accgctcgcc 5700 

gcagccgaac gaccgagcgc agcgagtcag tgagcgagga agcggaagag cgcccaatac 5760 

gcaaaccgcc tctccccgcg cgttggccga ttcattaatg cagctggcac gactagagtc 5820 

ccgctgaggc ggcgtagcag gtcagccgcc ccagcggtgg tcaccaaccg gggtggaacg 5880 

gcgccggtat cgggtgtgtc cgtggcgctc attccaacct ccgtgtgttt gtgcaggttt 5940 

cgcgtgttgc agtccctcgc accggcaccc gcagcgaggg gctcacgggt gccggtgggt 6000 

cgactagttc agtgatggtg atggtgatgt cctcgagatc taagcttgga tccgcggccg 6060 

ctacgtagaa ttcccatatg cgctcccttc tctgacgccg tccacgctgc ctcctcacgt 6120 

gacgtgaggt gcaagcccgg acgttccgcg tgccacgccg tgagccgccg cgtgccgtcg 6180 
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gctccctcag cccgggcggc cgtgggagcc cgcctcgata tgtacacccg agaagctccc 6240 

agcgtcctcc tgggccgcga tactcgacca ccacgcacgc acaccgcact aacgattcgg 6300 

ccggcgctcg attcggccgg cgctcgattc ggccggcgct cgattcggcc ggcgctcgat 6360 

tcggccggcg ctcgattcgg ccgagcagaa gagtgaacaa ccaccgacca cgcttccgct 6420 

ctgcgcgccg tacccgacct acctcccgca gctcgaagca gctcccggga gtaccgccgt 6480 

actcacccgc ctgtgctcac catccaccga cgcaaagccc aacccgagca cacctcttgc 6540 

accaaggtgc cgaccgtggc tttccgctcg cagggttcca gaagaaatcg aacgatccag 6600 

cgcggcaagg ttcaaaaagc aggggttggt ggggaggagg ttttgggggg tgtcgccggg 6660 

atacctgata tggctttgtt ttgcgtagtc gaataatttt ccatatagcc tcggcgcgtc 6720 

ggactcgaat agttgatgtg ggcgggcaca gttgccccat gaaatccgca acggggggcg 6780 

tgctgagcga tcggcaatgg gcggatgcgg tgttgcttcc gcaccggccg ttcgcgacga 6840 

acaacctcca acgaggtcag taccggatga gccgcgacga cgcattggca atgcggtacg 6900 

tcgagcattc accgcacgcg ttgctcggat ctatcgtcat cgactgcgat cacgttgacg 6960 

ccgcgatgcg cgcattcgag caaccatccg accatccggc gccgaactgg gtcgcacaat 7020 

■cgccgtccgg ccgcgcacac atcggatggt ggctcggccc caaccacgtg tgccgcaccg 7080 

acagcgcccg actgacgcca ctgcgctacg cccaccgcat cgaaaccggc ctcaagatca 7140 

gcgtcggcgg cgatttcgcg tatggcgggc aactgaccaa aaacccgatt caccccgatt 7200 

gggagacgat ctacggcccg gccaccccgt acacattgcg gcagctggcc accatccaca 7260 

caccccggca gatgccgcgt cggcccgatc gggccgtggg cctgggccgc aacgtcacca 7320 

tgttcgacgc cacccggcga tgggcatacc cgcagtggtg gcaacaccga aacggaaccg 7380 

gccgcgactg ggaccatctc gtcctgcagc actgccacgc cgtcaacacc gagttcacga 7440 

caccactgcc gttcaccgaa gtacgcgcca ccgcgcaatc catctccaaa tggatctggc 7500 

gcaatttcac cgaagaacag taccgagccc gacaagcgca tctcggtcaa aaaggcggca 7560 

aggcaacgac actcgccaaa caagaagccg tccgaaacaa tgcaagaaag tacgacgaac 7620 

atacgatgcg agaggcgatt atctgatggg cggagccaaa aatccggtgc gccgaaagat 7680 

gacggcagca gcagcagccg aaaaattcgg tgcctccact cgcacaatcc aacgcttgtt 7740 

tgctgagccg cgtgacgatt acctcggccg tgcgaaagct cgccgtgaca aagctgtcga 7800 

gctgcggaag caggggttga agtaccggga aatcgccgaa gcgatggaac tctcgaccgg 7860 

gatcgtcggc cgattactgc acgacgcccg caggcacggc gagatttcag cggaggatct 7920 
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gtcggcgtaa ccaagtcagc gggttgtcgg gttccggccg gcgctcggca ctcggaccgg 7980 
ccggcggatg gtgttctgcc tctggcgcag cgtcagctac cgccgaaggc ctgtcatcga 8040 
ccggcttcga ctgaagtatg agcaacgtca cagcctgtga ttggatgatc cgctcacgct 8100 
cgaccgctac ctgttcagct gccgcccgct gggcatgagc aacggccaac tctcgttcaa 8160 

<210> 110 
<211> 8189 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : vector 
pTip-LNHl 



<400> 110 














gagctcgacc 


gcgcgggtcc 


cggacgggga 


agagcgggga 


gctttgccag 


agagcgacga 


I 60 


cttccccttg 


cgttggtgat 


tgccggtcag 


ggcagccatc 


cgccatcgtc 


gcgtagggtg 


120 


tcacacccca 


ggaatcgcgt 


cactgaacac 


agcagccggt 


aggacgacca 


tgactgagtt 


180 


ggacaccatc 


gcaaatccgt 


ccgatcccgc 


ggtgcagcgg 


atcatcgatg 


tcaccaagcc 


240 


gtcacgatcc 


aacataaaga 


caacgttgat 


cgaggacgtc 


gagcccctca 


tgcacagcat 


300 


cgcggccggg 


gtggagttca 


tcgaggtcta 


cggcagcgac 


agcagtcctt 


ttccatctga 


360 


gttgctggat 


ctgtgcgggc 


ggcagaacat 


accggtccgc 


ctcatcgact 


cctcgatcgt 


420 


caaccagttg 


ttcaaggggg 


agcggaaggc 


caagacattc 


ggcatcgccc 


gcgtccctcg 


480 


cccggccagg 


ttcggcgata 


tcgcgagccg 


gcgtggggac 


gtcgtcgttc 


tcgacggggt 


540 


gaagatcgtc 


gggaacatcg 


gcgcgatagt 


acgcacgtcg 


ctcgcgctcg 


gagcgtcggg 


600 


gatcatcctg 


gtggacagtg 


acatcaccag 


catcgcggac 


cggcgtctcc 


aaagggccag 


660 


ccgaggttac 


gtcttctccc 


ttcccgtcgt 


tctctccggt 


cgcgaggagg 


ccatcgcctt 


720 


cattcgggac 


agcggtatgc 


agctgatgac 


gctcaaggcg 


gatggcgaca 


tttccgtgaa 


780 


ggaactcggg 


gacaatccgg 


atcggctggc 


cttgctgttc 


ggcagcgaaa 


agggtgggcc 


840 
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ttccgacctg ttcgaggagg cgtcttccgc ctcggtttcc atccccatga tgagccagac 900 

cgagtctctc aacgtttccg tttccctcgg aatcgcgctg cacgagagga tcgacaggaa 960 

tctcgcggcc aaccgataag cgcctctgtt cctcggacgc tcggttcctc gacctcgatt 1020 

cgtcagtgat gatcacctca cacggcagcg atcaccactg acatatcgag gtcaacggtc 1080 

gtggtccggg cgggcactcc tcgaaggcgc ggccgacgcc cttgaacgac tcgatgactc 1140 

tagagtaacg ggctactccg tttaacggac cccgttctca cgctttaggc ttgaccccgg 1200 

agcctgcatg gggcattccg ccgtgaaccc ggtggaatgc ccccggcacc cgggctttcc 1260 

agcaaagatc acctggcgcc gatgagtaag gcgtacagaa ccactccaca ggaggaccgt 1320 

cgagatgaaa tctaacaatg cgctcatcgt catcctcggc accgtcaccc tggatgctgt 1380 

aggcataggc ttggttatgc cggtactgcc gggcctcttg cgggatatcg tccattccga 1440 

cagcatcgcc agtcactatg gcgtgctgct agcgctatat gcgttgatgc aatttctatg 1500 

cgcacccgtt ctcggagcac tgtccgaccg ctttggccgc cgcccagtcc tgctcgcttc 1560 

gctacttgga gccactatcg actacgcgat catggcgacc acacccgtcc tgtggattct 1620 

ctacgccgga cgcatcgtgg ccggcatcac cggcgccaca ggtgcggttg ctggcgccta 1680 

tatcgccgac atcaccgatg gggaagatcg ggctcgccac ttcgggctca tgagcgcttg 1740 

tttcggcgtg ggtatggtgg caggccccgt ggccggggga ctgttgggcg ccatctcctt 1800 

gcatgcacca ttccttgcgg cggcggtgct caacggcctc aacctactac tgggctgctt 1860 

cctaatgcag gagtcgcata agggagagcg tcgtccgatg cccttgagag ccttcaaccc 1920 

agtcagctcc ttccggtggg cgcggggcat gactatcgtc gccgcactta tgactgtctt 1980 

ctttatcatg caactcgtag gacaggtgcc ggcagcgctc tgggtcattt tcggcgagga 2040 

ccgctttcgc tggagcgcga cgatgatcgg cctgtcgctt gcggtattcg gaatcttgca 2100 

cgccctcgct caagccttcg tcactggtcc cgccaccaaa cgtttcggcg agaagcaggc 2160 

cattatcgcc ggcatggcgg ccgacgcgct gggctacgtc ttgctggcgt tcgcgacgcg 2220 

aggctggatg gccttcccca ttatgattct tctcgcttcc ggcggcatcg ggatgcccgc 2280 

gttgcaggcc atgctgtcca ggcaggtaga tgacgaccat cagggacagc ttcaaggatc 2340 

gctcgcggct cttaccagcc taacttcgat cattggaccg ctgatcgtca cggcgattta 2400 

tgccgcctcg gcgagcacat ggaacgggtt ggcatggatt gtaggcgccg ccctatacct 2460 

tgtctgcctc cccgcgttgc gtcgcggtgc atggagccgg gccacctcga cctgaatgga 2520 

agccggcggc acctcgctaa cggattcacc actccaagaa ttggagccaa tcaattcttg 2580 
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cggagaactg tgaatgcgca aaccaaccct tggcagaaca tatccatcgc gtccgccatc 2640 

tccagcagcc gcacgcggcg catctcgggc agcgttgggt cctggccacg ggtgcgcatg 2700 

atcgtgctcc tgtcgttgag gactagaatt gatctcctcg accgccaatt gggcatctga 2760 

gaatcatctg cgtttctcgc acgcaacgta cttgcaacgt tgcaactcct agtgttgtga 2820 

atcacacccc accggggggt gggattgcag tcaccgattt ggtgggtgcg cccaggaaga 2880 

tcacgtttac ataggagctt gcaatgagct actccgtggg acaggtggcc ggcttcgccg 2940 

gagtgacggt gcgcacgctg caccactacg acgacatcgg cctgctcgta ccgagcgagc 3000 

gcagccacgc gggccaccgg cgctacagcg acgccgacct cgaccggctg cagcagatcc 3060 

tgttctaccg ggagctgggc ttcccgctcg acgaggtcgc cgccctgctc gacgacccgg 3120 

ccgcggaccc gcgcgcgcac ctgcgccgcc agcacgagct gctgtccgcc cggatcggga 3180 

aactgcagaa gatggcggcg gccgtggagc aggcgatgga ggcacgcagc atgggaatca 3240 

acctcacccc ggaggagaag ttcgaggtct tcggcgactt cgaccccgac cagtacgagg 3300 

aggaggtccg ggaacgctgg gggaacaccg acgcctaccg ccagtccaag gagaagaccg 3360 

cctcgtacac caaggaggac tggcagcgca tccaggacga ggccgacgag ctcacccggc 3420 

gcttcgtcgc cctgatggac gcgggtgagc ccgccgactc cgagggggcg atggacgccg 3480 

ccgaggacca ccggcagggc atcgcccgca accactacga ctgcgggtac gagatgcaca 3540 

cctgcctggg cgagatgtac gtgtccgacg aacgtttcac gcgaaacatc gacgccgcca 3600 

agccgggcct cgccgcctac atgcgcgacg cgatcctcgc caacgccgtc cggcacaccc 3660 

cctgagcggt ggtcgtggcc cgggtctccc gcccggtctc accccacggc tcactcccgg 3720 

gccacgacca ccgccgtccc gtacgcgcac acctcggtgc ccacgtccgc cgcctccgtc 3780 

acgtcgaaac ggaagatccc cgggtaccga gctcgtcagg tggcactttt cggggaaatg 3840 

tgcgcggaac ccctatttgt ttatttttct aaatacattc aaatatgtat ccgctcatga 3900 

gacaataacc ctgataaatg cttcaataat attgaaaaag gaagagtatg agtattcaac 3960 

atttccgtgt cgcccttatt cccttttttg cggcattttg ccttcctgtt tttgctcacc 4020 

cagaaacgct ggtgaaagta aaagatgctg aagatcagtt gggtgcacga gtgggttaca 4080 

tcgaactgga tctcaacagc ggtaagatcc ttgagagttt tcgccccgaa gaacgttttc 4140 

caatgatgag cacttttaaa gttctgctat gtggcgcggt attatcccgt attgacgccg 4200 

ggcaagagca actcggtcgc cgcatacact attctcagaa tgacttggtt gagtactcac 4260 

cagtcacaga aaagcatctt acggatggca tgacagtaag agaattatgc agtgctgcca 4320 
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taaccatgag tgataacact gcggccaact tacttctgac aacgatcgga ggaccgaagg 4380 

agctaaccgc ttttttgcac aacatggggg atcatgtaac tcgccttgat cgttgggaac 4440 

cggagctgaa tgaagccata ccaaacgacg agcgtgacac cacgatgcct gtagcaatgg 4500 

caacaacgtt gcgcaaacta ttaactggcg aactacttac tctagcttcc cggcaacaat 4560 

taatagactg gatggaggcg gataaagttg caggaccact tctgcgctcg gcccttccgg 4620 

ctggctggtt tattgctgat aaatctggag ccggtgagcg tgggtctcgc ggtatcattg 4680 

cagcactggg gccagatggt aagccctccc gtatcgtagt tatctacacg acggggagtc 4740 

aggcaactat ggatgaacga aatagacaga tcgctgagat aggtgcctca ctgattaagc 4800 

attggtaact gtcagaccaa gtttactcat atatacttta gattgattta aaacttcatt 4860 

tttaatttaa aaggatctag gtgaagatcc tttttgataa tctcatgacc aaaatccctt 4920 

aacgtgagtt ttcgttccac tgagcgtcag accccgtaga aaagatcaaa ggatcttctt 4980 

gagatccttt ttttctgcgc gtaatctgct gcttgcaaac aaaaaaacca ccgctaccag 5040 

cggtggtttg tttgccggat caagagctac caactctttt tccgaaggta actggcttca 5100 

gcagagcgca gataccaaat actgttcttc tagtgtagcc gtagttaggc caccacttca 5160 

agaactctgt agcaccgcct acatacctcg ctctgctaat cctgttacca gtggctgctg 5220 

ccagtggcga taagtcgtgt cttaccgggt tggactcaag acgatagtta ccggataagg 5280 

cgcagcggtc gggctgaacg gggggttcgt gcacacagcc cagcttggag cgaacgacct 5340 

acaccgaact gagataccta cagcgtgagc tatgagaaag cgccacgctt cccgaaggga 5400 

gaaaggcgga caggtatccg gtaagcggca gggtcggaac aggagagcgc acgagggagc 5460 

ttccaggggg aaacgcctgg tatctttata gtcctgtcgg gtttcgccac ctctgacttg 5520 

agcgtcgatt tttgtgatgc tcgtcagggg ggcggagcct atggaaaaac gccagcaacg 5580 

cggccttttt acggttcctg gccttttgct ggccttttgc tcacatgttc tttcctgcgt 5640 

tatcccctga ttctgtggat aaccgtatta ccgcctttga gtgagctgat accgctcgcc 5700 

gcagccgaac gaccgagcgc agcgagtcag tgagcgagga agcggaagag cgcccaatac 5760 

gcaaaccgcc tctccccgcg cgttggccga ttcattaatg cagctggcac gactagagtc 5820 

ccgctgaggc ggcgtagcag gtcagccgcc ccagcggtgg tcaccaaccg gggtggaacg 5880 

gcgccggtat cgggtgtgtc cgtggcgctc attccaacct ccgtgtgttt gtgcaggttt 5940 

cgcgtgttgc agtccctcgc accggcaccc gcagcgaggg gctcacgggt gccggtgggt 6000 

cgactagttc atcctcgaga tctaagcttg gatccgcggc cgctacgtag aattcccata 6060 
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tggtgatggt gatggtggcc catggtatat ctccttctta aagttaaaca aaattatttc 6120 

tagacgccgt ccacgctgcc tcctcacgtg acgtgaggtg caagcccgga cgttccgcgt 6180 

gccacgccgt gagccgccgc gtgccgtcgg ctccctcagc ccgggcggcc gtgggagccc 6240 

gcctcgatat gtacacccga gaagctccca gcgtcctcct gggccgcgat actcgaccac 6300 

cacgcacgca caccgcacta acgattcggc cggcgctcga ttcggccggc gctcgattcg 6360 

gccggcgctc gattcggccg gcgctcgatt cggccggcgc tcgattcggc cgagcagaag 6420 

agtgaacaac caccgaccac gcttccgctc tgcgcgccgt acccgaccta cctcccgcag 6480 

ctcgaagcag ctcccgggag taccgccgta ctcacccgcc tgtgctcacc atccaccgac 6540 

gcaaagccca acccgagcac acctcttgca ccaaggtgcc gaccgtggct ttccgctcgc 6600 

agggttccag aagaaatcga acgatccagc gcggcaaggt tcaaaaagca ggggttggtg 6660 

gggaggaggt tttggggggt gtcgccggga tacctgatat ggctttgttt tgcgtagtcg 6720 

aataattttc catatagcct cggcgcgtcg gactcgaata gttgatgtgg gcgggcacag 6780 

ttgccccatg aaatccgcaa cggggggcgt gctgagcgat cggcaatggg cggatgcggt 6840 

gttgcttccg caccggccgt tcgcgacgaa caacctccaa cgaggtcagt accggatgag 6900 

ccgcgacgac gcattggcaa tgcggtacgt cgagcattca ccgcacgcgt tgctcggatc 6960 

tatcgtcatc gactgcgatc acgttgacgc cgcgatgcgc gcattcgagc aaccatccga 7020 

ccatccggcg ccgaactggg tcgcacaatc gccgtccggc cgcgcacaca tcggatggtg 7080 

gctcggcccc aaccacgtgt gccgcaccga cagcgcccga ctgacgccac tgcgctacgc 7140 

ccaccgcatc gaaaccggcc tcaagatcag cgtcggcggc gatttcgcgt atggcgggca 7200 

actgaccaaa aacccgattc accccgattg ggagacgatc tacggcccgg ccaccccgta 7260 

cacattgcgg cagctggcca ccatccacac accccggcag atgccgcgtc ggcccgatcg 7320 

ggccgtgggc ctgggccgca acgtcaccat gttcgacgcc acccggcgat gggcataccc 7380 

gcagtggtgg caacaccgaa acggaaccgg ccgcgactgg gaccatctcg tcctgcagca 7440 

ctgccacgcc gtcaacaccg agttcacgac accactgccg ttcaccgaag tacgcgccac 7500 

cgcgcaatcc atctccaaat ggatctggcg caatttcacc gaagaacagt accgagcccg 7560 

acaagcgcat ctcggtcaaa aaggcggcaa ggcaacgaca ctcgccaaac aagaagccgt 7620 

ccgaaacaat gcaagaaagt acgacgaaca tacgatgcga gaggcgatta tctgatgggc 7680 

ggagccaaaa atccggtgcg ccgaaagatg acggcagcag cagcagccga aaaattcggt 7740 

gcctccactc gcacaatcca acgcttgttt gctgagccgc gtgacgatta cctcggccgt 7800 
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gcgaaagctc gccgtgacaa agctgtcgag ctgcggaagc aggggttgaa gtaccgggaa 7860 

atcgccgaag cgatggaact ctcgaccggg atcgtcggcc gattactgca cgacgcccgc 7920 

aggcacggcg agatttcagc ggaggatctg tcggcgtaac caagtcagcg ggttgtcggg 7980 

ttccggccgg cgctcggcac tcggaccggc cggcggatgg tgttctgcct ctggcgcagc 8040 

gtcagctacc gccgaaggcc tgtcatcgac cggcttcgac tgaagtatga gcaacgtcac 8100 

agcctgtgat tggatgatcc gctcacgctc gaccgctacc tgttcagctg ccgcccgctg 8160 

ggcatgagca acggccaact ctcgttcaa 8189 

<210> 111 
<211> 8183 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: vector 
pTip-LNH2 

<400> 111 

gagctcgacc gcgcgggtcc cggacgggga agagcgggga gctttgccag agagcgacga 60 

cttccccttg cgttggtgat tgccggtcag ggcagccatc cgccatcgtc gcgtagggtg 120 

tcacacccca ggaatcgcgt cactgaacac agcagccggt aggacgacca tgactgagtt 180 

ggacaccatc gcaaatccgt ccgatcccgc ggtgcagcgg atcatcgatg tcaccaagcc 240 

gtcacgatcc aacataaaga caacgttgat cgaggacgtc gagcccctca tgcacagcat 300 

cgcggccggg gtggagttca tcgaggtcta cggcagcgac agcagtcctt ttccatctga 360 

gttgctggat ctgtgcgggc ggcagaacat accggtccgc ctcatcgact cctcgatcgt 420 

caaccagttg ttcaaggggg agcggaaggc caagacattc ggcatcgccc gcgtccctcg 480 

cccggccagg ttcggcgata tcgcgagccg gcgtggggac gtcgtcgttc tcgacggggt 540 

gaagatcgtc gggaacatcg gcgcgatagt acgcacgtcg ctcgcgctcg gagcgtcggg 600 

gatcatcctg gtggacagtg acatcaccag catcgcggac cggcgtctcc aaagggccag 660 
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ccgaggttac gtcttctccc ttcccgtcgt tctctccggt cgcgaggagg ccatcgcctt 720 

cattcgggac agcggtatgc agctgatgac gctcaaggcg gatggcgaca tttccgtgaa 780 

ggaactcggg gacaatccgg atcggctggc cttgctgttc ggcagcgaaa agggtgggcc 840 

ttccgacctg ttcgaggagg cgtcttccgc ctcggtttcc atccccatga tgagccagac 900 

cgagtctctc aacgtttccg tttccctcgg aatcgcgctg cacgagagga tcgacaggaa 960 

tctcgcggcc aaccgataag cgcctctgtt cctcggacgc tcggttcctc gacctcgatt 1020 

cgtcagtgat gatcacctca cacggcagcg atcaccactg acatatcgag gtcaacggtc 1080 

gtggtccggg cgggcactcc tcgaaggcgc ggccgacgcc cttgaacgac tcgatgactc 1140 

tagagtaacg ggctactccg tttaacggac cccgttctca cgctttaggc ttgaccccgg 1200 

agcctgcatg gggcattccg ccgtgaaccc ggtggaatgc ccccggcacc cgggctttcc 1260 

agcaaagatc acctggcgcc gatgagtaag gcgtacagaa ccactccaca ggaggaccgt 1320 

cgagatgaaa tctaacaatg cgctcatcgt catcctcggc accgtcaccc tggatgctgt 1380 

aggcataggc ttggttatgc cggtactgcc gggcctcttg cgggatatcg tccattccga 1440 

cagcatcgcc agtcactatg gcgtgctgct agcgctatat gcgttgatgc aatttctatg 1500 

cgcacccgtt ctcggagcac tgtccgaccg ctttggccgc cgcccagtcc tgctcgcttc 1560 

gctacttgga gccactatcg actacgcgat catggcgacc acacccgtcc tgtggattct 1620 

ctacgccgga cgcatcgtgg ccggcatcac cggcgccaca ggtgcggttg ctggcgccta 1680 

tatcgccgac atcaccgatg gggaagatcg ggctcgccac ttcgggctca tgagcgcttg 1740 

tttcggcgtg ggtatggtgg caggccccgt ggccggggga ctgttgggcg ccatctcctt 1800 

gcatgcacca ttccttgcgg cggcggtgct caacggcctc aacctactac tgggctgctt 1860 

cctaatgcag gagtcgcata agggagagcg tcgtccgatg cccttgagag ccttcaaccc 1920 

agtcagctcc ttccggtggg cgcggggcat gactatcgtc gccgcactta tgactgtctt 1980 

ctttatcatg caactcgtag gacaggtgcc ggcagcgctc tgggtcattt tcggcgagga 2040 

ccgctttcgc tggagcgcga cgatgatcgg cctgtcgctt gcggtattcg gaatcttgca 2100 

cgccctcgct caagccttcg tcactggtcc cgccaccaaa cgtttcggcg agaagcaggc 2160 

cattatcgcc ggcatggcgg ccgacgcgct gggctacgtc ttgctggcgt tcgcgacgcg 2220 

aggctggatg gccttcccca ttatgattct tctcgcttcc ggcggcatcg ggatgcccgc 2280 

gttgcaggcc atgctgtcca ggcaggtaga tgacgaccat cagggacagc ttcaaggatc 2340 

gctcgcggct cttaccagcc taacttcgat cattggaccg ctgatcgtca cggcgattta 2400 
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tgccgcctcg gcgagcacat ggaacgggtt ggcatggatt gtaggcgccg ccctatacct 2460 

tgtctgcctc cccgcgttgc gtcgcggtgc atggagccgg gccacctcga cctgaatgga 2520 

agccggcggc acctcgctaa cggattcacc actccaagaa ttggagccaa tcaattcttg 2580 

cggagaactg tgaatgcgca aaccaaccct tggcagaaca tatccatcgc gtccgccatc 2640 

tccagcagcc gcacgcggcg catctcgggc agcgttgggt cctggccacg ggtgcgcatg 2700 

atcgtgctcc tgtcgttgag gactagaatt gatctcctcg accgccaatt gggcatctga 2760 

gaatcatctg cgtttctcgc acgcaacgta cttgcaacgt tgcaactcct agtgttgtga 2820 

atcacacccc accggggggt gggattgcag tcaccgattt ggtgggtgcg cccaggaaga 2880 

tcacgtttac ataggagctt gcaatgagct actccgtggg acaggtggcc ggcttcgccg 2940 

gagtgacggt gcgcacgctg caccactacg acgacatcgg cctgctcgta ccgagcgagc 3000 

gcagccacgc gggccaccgg cgctacagcg acgccgacct cgaccggctg cagcagatcc 3060 

tgttctaccg ggagctgggc ttcccgctcg acgaggtcgc cgccctgctc gacgacccgg 3120 

ccgcggaccc gcgcgcgcac ctgcgccgcc agcacgagct gctgtccgcc cggatcggga 3180 

aactgcagaa gatggcggcg gccgtggagc aggcgatgga ggcacgcagc atgggaatca 3240 

acctcacccc ggaggagaag ttcgaggtct tcggcgactt cgaccccgac cagtacgagg 3300 

aggaggtccg ggaacgctgg gggaacaccg acgcctaccg ccagtccaag gagaagaccg 3360 

cctcgtacac caaggaggac tggcagcgca tccaggacga ggccgacgag ctcacccggc 3420 

gcttcgtcgc cctgatggac gcgggtgagc ccgccgactc cgagggggcg atggacgccg 3480 

ccgaggacca ccggcagggc atcgcccgca accactacga ctgcgggtac gagatgcaca 3540 

cctgcctggg cgagatgtac gtgtccgacg aacgtttcac gcgaaacatc gacgccgcca 3600 

agccgggcct cgccgcctac atgcgcgacg cgatcctcgc caacgccgtc cggcacaccc 3660 

cctgagcggt ggtcgtggcc cgggtctccc gcccggtctc accccacggc tcactcccgg 3720 

gccacgacca ccgccgtccc gtacgcgcac acctcggtgc ccacgtccgc cgcctccgtc 3780 

acgtcgaaac ggaagatccc cgggtaccga gctcgtcagg tggcactttt cggggaaatg 3840 

tgcgcggaac ccctatttgt ttatttttct aaatacattc aaatatgtat ccgctcatga 3900 

gacaataacc ctgataaatg cttcaataat attgaaaaag gaagagtatg agtattcaac 3960 

atttccgtgt cgcccttatt cccttttttg cggcattttg ccttcctgtt tttgctcacc 4020 

cagaaacgct ggtgaaagta aaagatgctg aagatcagtt gggtgcacga gtgggttaca 4080 

tcgaactgga tctcaacagc ggtaagatcc ttgagagttt tcgccccgaa gaacgttttc 4140 
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caatgatgag cacttttaaa gttctgctat gtggcgcggt attatcccgt attgacgccg 4200 

ggcaagagca actcggtcgc cgcatacact attctcagaa tgacttggtt gagtactcac 4260 

cagtcacaga aaagcatctt acggatggca tgacagtaag agaattatgc agtgctgcca 4320 

taaccatgag tgataacact gcggccaact tacttctgac aacgatcgga ggaccgaagg 4380 

agctaaccgc ttttttgcac aacatggggg atcatgtaac tcgccttgat cgttgggaac 4440 

cggagctgaa tgaagccata ccaaacgacg agcgtgacac cacgatgcct gtagcaatgg 4500 

caacaacgtt gcgcaaacta ttaactggcg aactacttac tctagcttcc cggcaacaat 4560 

taatagactg gatggaggcg gataaagttg caggaccact tctgcgctcg gcccttccgg 4620 

ctggctggtt tattgctgat aaatctggag ccggtgagcg tgggtctcgc ggtatcattg 4680 

cagcactggg gccagatggt aagccctccc gtatcgtagt tatctacacg acggggagtc 4740 

aggcaactat ggatgaacga aatagacaga tcgctgagat aggtgcctca ctgattaagc 4800 

attggtaact gtcagaccaa gtttactcat atatacttta gattgattta aaacttcatt 4860 

tttaatttaa aaggatctag gtgaagatcc tttttgataa tctcatgacc aaaatccctt 4920 

aacgtgagtt ttcgttccac tgagcgtcag accccgtaga aaagatcaaa ggatcttctt 4980 

gagatccttt ttttctgcgc gtaatctgct gcttgcaaac aaaaaaacca ccgctaccag 5040 

cggtggtttg tttgccggat caagagctac caactctttt tccgaaggta actggcttca 5100 

gcagagcgca gataccaaat actgttcttc tagtgtagcc gtagttaggc caccacttca 5160 

agaactctgt agcaccgcct acatacctcg ctctgctaat cctgttacca gtggctgctg 5220 

ccagtggcga taagtcgtgt cttaccgggt tggactcaag acgatagtta ccggataagg 5280 

cgcagcggtc gggctgaacg gggggttcgt gcacacagcc cagcttggag cgaacgacct 5340 

acaccgaact gagataccta cagcgtgagc tatgagaaag cgccacgctt cccgaaggga 5400 

gaaaggcgga caggtatccg gtaagcggca gggtcggaac aggagagcgc acgagggagc 5460 

ttccaggggg aaacgcctgg tatctttata gtcctgtcgg gtttcgccac ctctgacttg 5520 

agcgtcgatt tttgtgatgc tcgtcagggg ggcggagcct atggaaaaac gccagcaacg 5580 

cggccttttt acggttcctg gccttttgct ggccttttgc tcacatgttc tttcctgcgt 5640 

tatcccctga ttctgtggat aaccgtatta ccgcctttga gtgagctgat accgctcgcc 5700 

gcagccgaac gaccgagcgc agcgagtcag tgagcgagga agcggaagag cgcccaatac 5760 

gcaaaccgcc tctccccgcg cgttggccga ttcattaatg cagctggcac gactagagtc 5820 

ccgctgaggc ggcgtagcag gtcagccgcc ccagcggtgg tcaccaaccg gggtggaacg 5880 
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gcgccggtat cgggtgtgtc cgtggcgctc attccaacct ccgtgtgttt gtgcaggttt 5940 

cgcgtgttgc agtccctcgc accggcaccc gcagcgaggg gctcacgggt gccggtgggt 6000 

cgactagttc agtgatggtg atggtgatgt cctcgagatc taagcttgga tccgcggccg 6060 

ctacgtagaa ttcccatggt atatctcctt cttaaagtta aacaaaatta tttctagacg 6120 

ccgtccacgc tgcctcctca cgtgacgtga ggtgcaagcc cggacgttcc gcgtgccacg 6180 

ccgtgagccg ccgcgtgccg tcggctccct cagcccgggc ggccgtggga gcccgcctcg 6240 

atatgtacac ccgagaagct cccagcgtcc tcctgggccg cgatactcga ccaccacgca 6300 

cgcacaccgc actaacgatt cggccggcgc tcgattcggc cggcgctcga ttcggccggc 6360 

gctcgattcg gccggcgctc gattcggccg gcgctcgatt cggccgagca gaagagtgaa 6420 

caaccaccga ccacgcttcc gctctgcgcg ccgtacccga cctacctccc gcagctcgaa 6480 

gcagctcccg ggagtaccgc cgtactcacc cgcctgtgct caccatccac cgacgcaaag 6540 

cccaacccga gcacacctct tgcaccaagg tgccgaccgt ggctttccgc tcgcagggtt 6600 

ccagaagaaa tcgaacgatc cagcgcggca aggttcaaaa agcaggggtt ggtggggagg 6660 

aggttttggg gggtgtcgcc gggatacctg atatggcttt gttttgcgta gtcgaataat 6720 

tttccatata gcctcggcgc gtcggactcg aatagttgat gtgggcgggc acagttgccc 6780 

catgaaatcc gcaacggggg gcgtgctgag cgatcggcaa tgggcggatg cggtgttgct 6840 

tccgcaccgg ccgttcgcga cgaacaacct ccaacgaggt cagtaccgga tgagccgcga 6900 

cgacgcattg gcaatgcggt acgtcgagca ttcaccgcac gcgttgctcg gatctatcgt 6960 

catcgactgc gatcacgttg acgccgcgat gcgcgcattc gagcaaccat ccgaccatcc 7020 

ggcgccgaac tgggtcgcac aatcgccgtc cggccgcgca cacatcggat ggtggctcgg 7080 

ccccaaccac gtgtgccgca ccgacagcgc ccgactgacg ccactgcgct acgcccaccg 7140 

catcgaaacc ggcctcaaga tcagcgtcgg cggcgatttc gcgtatggcg ggcaactgac 7200 

caaaaacccg attcaccccg attgggagac gatctacggc ccggccaccc cgtacacatt 7260 

gcggcagctg gccaccatcc acacaccccg gcagatgccg cgtcggcccg atcgggccgt 7320 

gggcctgggc cgcaacgtca ccatgttcga cgccacccgg cgatgggcat acccgcagtg 7380 

gtggcaacac cgaaacggaa ccggccgcga ctgggaccat ctcgtcctgc agcactgcca 7440 

cgccgtcaac accgagttca cgacaccact gccgttcacc gaagtacgcg ccaccgcgca 7500 

atccatctcc aaatggatct ggcgcaattt caccgaagaa cagtaccgag cccgacaagc 7560 

gcatctcggt caaaaaggcg gcaaggcaac gacactcgcc aaacaagaag ccgtccgaaa 7620 
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caatgcaaga aagtacgacg aacatacgat gcgagaggcg attatctgat gggcggagcc 7680 

aaaaatccgg tgcgccgaaa gatgacggca gcagcagcag ccgaaaaatt cggtgcctcc 7740 

actcgcacaa tccaacgctt gtttgctgag ccgcgtgacg attacctcgg ccgtgcgaaa 7800 

gctcgccgtg acaaagctgt cgagctgcgg aagcaggggt tgaagtaccg ggaaatcgcc 7860 

gaagcgatgg aactctcgac cgggatcgtc ggccgattac tgcacgacgc ccgcaggcac 7920 

ggcgagattt cagcggagga tctgtcggcg taaccaagtc agcgggttgt cgggttccgg 7980 

ccggcgctcg gcactcggac cggccggcgg atggtgttct gcctctggcg cagcgtcagc 8040 

taccgccgaa ggcctgtcat cgaccggctt cgactgaagt atgagcaacg tcacagcctg 8100 

tgattggatg atccgctcac gctcgaccgc tacctgttca gctgccgccc gctgggcatg 8160 

agcaacggcc aactctcgtt caa 8183 

<210> 112 
<211> 8123 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : vector 
pTip-LCHl 

<400> 112 

gagctcgacc gcgcgggtcc cggacgggga agagcgggga gctttgccag agagcgacga 60 

cttccccttg cgttggtgat tgccggtcag ggcagccatc cgccatcgtc gcgtagggtg 120 

tcacacccca ggaatcgcgt cactgaacac agcagccggt aggacgacca tgactgagtt 180 

ggacaccatc gcaaatccgt ccgatcccgc ggtgcagcgg atcatcgatg tcaccaagcc 240 

gtcacgatcc aacataaaga caacgttgat cgaggacgtc gagcccctca tgcacagcat 300 

cgcggccggg gtggagttca tcgaggtcta cggcagcgac agcagtcett ttccatctga 360 

gttgctggat ctgtgcgggc ggcagaacat accggtccgc ctcatcgact cctcgatcgt 420 

cccggccagg ttcggcgata tcgcgagccg gcgtggggac gtcgtcgttc tcgacggggt 480 
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gaagatcgtc gggaacatcg gcgcgatagt acgcacgtcg ctcgcgctcg gagcgtcggg 540 

gatcatcctg gtggacagtg acatcaccag catcgcggac cggcgtctcc aaagggccag 600 

ccgaggttac gtcttctccc ttcccgtcgt tctctccggt cgcgaggagg ccatcgcctt 660 

cattcgggac agcggtatgc agctgatgac gctcaaggcg gatggcgaca tttccgtgaa 720 

ggaactcggg gacaatccgg atcggctggc cttgctgttc ggcagcgaaa agggtgggcc 780 

ttccgacctg ttcgaggagg cgtcttccgc ctcggtttcc atccccatga tgagccagac 840 

cgagtctctc aacgtttccg tttccctcgg aatcgcgctg cacgagagga tcgacaggaa 900 

tctcgcggcc aaccgataag cgcctctgtt cctcggacgc tcggttcctc gacctcgatt 960 

cgtcagtgat gatcacctca cacggcagcg atcaccactg acatatcgag gtcaacggtc 1020 

gtggtccggg cgggcactcc tcgaaggcgc ggccgacgcc cttgaacgac tcgatgactc 1080 

tagagtaacg ggctactccg tttaacggac cccgttctca cgctttaggc ttgaccccgg 1140 

agcctgcatg gggcattccg ccgtgaaccc ggtggaatgc ccccggcacc cgggctttcc 1200 

agcaaagatc acctggcgcc gatgagtaag gcgtacagaa ccactccaca ggaggaccgt 1260 

cgagatgaaa tctaacaatg cgctcatcgt catcctcggc accgtcaccc tggatgctgt 1320 

aggcataggc ttggttatgc cggtactgcc gggcctcttg cgggatatcg tccattccga 1380 

cagcatcgcc agtcactatg gcgtgctgct agcgctatat gcgttgatgc aatttctatg 1440 

cgcacccgtt ctcggagcac tgtccgaccg ctttggccgc cgcccagtcc tgctcgcttc 1500 

gctacttgga gccactatcg actacgcgat catggcgacc acacccgtcc tgtggattct 1560 

ctacgccgga cgcatcgtgg ccggcatcac cggcgccaca ggtgcggttg ctggcgccta 1620 

tatcgccgac atcaccgatg gggaagatcg ggctcgccac ttcgggctca tgagcgcttg 1680 

tttcggcgtg ggtatggtgg caggccccgt ggccggggga ctgttgggcg ccatctcctt 1740 

gcatgcacca ttccttgcgg cggcggtgct caacggcctc aacctactac tgggctgctt 1800 

cctaatgcag gagtcgcata agggagagcg tcgtccgatg cccttgagag ccttcaaccc 1860 

agtcagctcc ttccggtggg cgcggggcat gactatcgtc gccgcactta tgactgtctt 1920 

ctttatcatg caactcgtag gacaggtgcc ggcagcgctc tgggtcattt tcggcgagga 1980 

ccgctttcgc tggagcgcga cgatgatcgg cctgtcgctt gcggtattcg gaatcttgca 2040 

cgccctcgct caagccttcg tcactggtcc cgccaccaaa cgtttcggcg agaagcaggc 2100 

cattatcgcc ggcatggcgg ccgacgcgct gggctacgtc ttgctggcgt tcgcgacgcg 2160 

aggctggatg gccttcccca ttatgattct tctcgcttcc ggcggcatcg ggatgcccgc 2220 
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gttgcaggcc atgctgtcca ggcaggtaga tgacgaccat cagggacagc ttcaaggatc 2280 

gctcgcggct cttaccagcc taacttcgat cattggaccg ctgatcgtca cggcgattta 2340 

tgccgcctcg gcgagcacat ggaacgggtt ggcatggatt gtaggcgccg ccctatacct 2400 

tgtctgcctc cccgcgttgc gtcgcggtgc atggagccgg gccacctcga cctgaatgga 2460 

agccggcggc acctcgctaa cggattcacc actccaagaa ttggagccaa tcaattcttg 2520 

cggagaactg tgaatgcgca aaccaaccct tggcagaaca tatccatcgc gtccgccatc 2580 

tccagcagcc gcacgcggcg catctcgggc agcgttgggt cctggccacg ggtgcgcatg 2640 

atcgtgctcc tgtcgttgag gactagaatt gatctcctcg accgccaatt gggcatctga 2700 

gaatcatctg cgtttctcgc acgcaacgta cttgcaacgt tgcaactcct agtgttgtga 2760 

atcacacccc accggggggt gggattgcag tcaccgattt ggtgggtgcg cccaggaaga 2820 

tcacgtttac ataggagctt gcaatgagct actccgtggg acaggtggcc ggcttcgccg 2880 

gagtgacggt gcgcacgctg caccactacg acgacatcgg cctgctcgta ccgagcgagc 2940 

gcagccacgc gggccaccgg cgctacagcg acgccgacct cgaccggctg cagcagatcc 3000 

tgttctaccg ggagctgggc ttcccgctcg acgaggtcgc cgccctgctc gacgacccgg 3060 

ccgcggaccc gcgcgcgcac ctgcgccgcc agcacgagct gctgtccgcc cggatcggga 3120 

aactgcagaa gatggcggcg gccgtggagc aggcgatgga ggcacgcagc atgggaatca 3180 

acctcacccc ggaggagaag ttcgaggtct tcggcgactt cgaccccgac cagtacgagg 3240 

aggaggtccg ggaacgctgg gggaacaccg acgcctaccg ccagtccaag gagaagaccg 3300 

cctcgtacac caaggaggac tggcagcgca tccaggacga ggccgacgag ctcacccggc 3360 

gcttcgtcgc cctgatggac gcgggtgagc ccgccgactc cgagggggcg atggacgccg 3420 

ccgaggacca ccggcagggc atcgcccgca accactacga ctgcgggtac gagatgcaca 3480 

cctgcctggg cgagatgtac gtgtccgacg aacgtttcac gcgaaacatc gacgccgcca 3540 

agccgggcct cgccgcctac atgcgcgacg cgatcctcgc caacgccgtc cggcacaccc 3600 

cctgagcggt ggtcgtggcc cgggtctccc gcccggtctc accccacggc tcactcccgg 3660 

gccacgacca ccgccgtccc gtacgcgcac acctcggtgc ccacgtccgc cgcctccgtc 3720 

acgtcgaaac ggaagatccc cgggtaccga gctcgtcagg tggcactttt cggggaaatg 3780 

tgcgcggaac ccctatttgt ttatttttct aaatacattc aaatatgtat ccgctcatga 3840 

gacaataacc ctgataaatg cttcaataat attgaaaaag gaagagtatg agtattcaac 3900 

atttccgtgt cgcccttatt cccttttttg cggcattttg ccttcctgtt tttgctcacc 3960 
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cagaaacgct ggtgaaagta aaagatgctg aagatcagtt gggtgcacga gtgggttaca 4020 

tcgaactgga tctcaacagc ggtaagatcc ttgagagttt tcgccccgaa gaacgttttc 4080 

caatgatgag cacttttaaa gttctgctat gtggcgcggt attatcccgt attgacgccg 4140 

ggcaagagca actcggtcgc cgcatacact attctcagaa tgacttggtt gagtactcac 4200 

cagtcacaga aaagcatctt acggatggca tgacagtaag agaattatgc agtgctgcca 4260 

taaccatgag tgataacact gcggccaact tacttctgac aacgatcgga ggaccgaagg 4320 

agctaaccgc ttttttgcac aacatggggg atcatgtaac tcgccttgat cgttgggaac 4380 

cggagctgaa tgaagccata ccaaacgacg agcgtgacac cacgatgcct gtagcaatgg 4440 

caacaacgtt gcgcaaacta ttaactggcg aactacttac tctagcttcc cggcaacaat 4500 

taatagactg gatggaggcg gataaagttg caggaccact tctgcgctcg gcccttccgg 4560 

ctggctggtt tattgctgat aaatctggag ccggtgagcg tgggtctcgc ggtatcattg 4620 

cagcactggg gccagatggt aagccctccc gtatcgtagt tatctacacg acggggagtc 4680 

aggcaactat ggatgaacga aatagacaga tcgctgagat aggtgcctca ctgattaagc 4740 

attggtaact gtcagaccaa gtttactcat atatacttta gattgattta aaacttcatt 4800 

tttaatttaa aaggatctag gtgaagatcc tttttgataa tctcatgacc aaaatccctt 4860 

aacgtgagtt ttcgttccac tgagcgtcag accccgtaga aaagatcaaa ggatcttctt 4920 

gagatccttt ttttctgcgc gtaatctgct gcttgcaaac aaaaaaacca ccgctaccag 4980 

cggtggtttg tttgccggat caagagctac caactctttt tccgaaggta actggcttca 5040 

gcagagcgca gataccaaat actgttcttc tagtgtagcc gtagttaggc caccacttca 5100 

agaactctgt agcaccgcct acatacctcg ctctgctaat cctgttacca gtggctgctg 5160 

ccagtggcga taagtcgtgt cttaccgggt tggactcaag acgatagtta ccggataagg 5220 

cgcagcggtc gggctgaacg gggggttcgt gcacacagcc cagcttggag cgaacgacct 5280 

acaccgaact gagataccta cagcgtgagc tatgagaaag cgccacgctt cccgaaggga 5340 

gaaaggcgga caggtatccg gtaagcggca gggtcggaac aggagagcgc acgagggagc 5400 

ttccaggggg aaacgcctgg tatctttata gtcctgtcgg gtttcgccac ctctgacttg 5460 

agcgtcgatt tttgtgatgc tcgtcagggg ggcggagcct atggaaaaac gccagcaacg 5520 

cggccttttt acggttcctg gccttttgct ggccttttgc tcacatgttc tttcctgcgt 5580 

tatcccctga ttctgtggat aaccgtatta ccgcctttga gtgagctgat accgctcgcc 5640 

gcagccgaac gaccgagcgc agcgagtcag tgagcgagga agcggaagag cgcccaatac 5700 
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gcaaaccgcc tctccccgcg cgttggccga ttcattaatg cagctggcac gactagagtc 5760 

ccgctgaggc ggcgtagcag gtcagccgcc ccagcggtgg tcaccaaccg gggtggaacg 5820 

gcgccggtat cgggtgtgtc cgtggcgctc attccaacct ccgtgtgttt gtgcaggttt 5880 

cgcgtgttgc agtccctcgc accggcaccc gcagcgaggg gctcacgggt gccggtgggt 5940 

cgactagttc agtgatggtg atggtgatgt cctcgagatc taagcttgga tccgcggccg 6000 

ctacgtagaa ttcccatggt atatctcctt cttaaagtta aacaaaatta tttctagacg 6060 

ccgtccacgc tgcctcctca cgtgacgtga ggtgcaagcc cggacgttcc gcgtgccacg 6120 

ccgtgagccg ccgcgtgccg tcggctccct cagcccgggc ggccgtggga gcccgcctcg 6180 

atatgtacac ccgagaagct cccagcgtcc tcctgggccg cgatactcga ccaccacgca 6240 

cgcacaccgc actaacgatt cggccggcgc tcgattcggc cggcgctcga ttcggccggc 6300 

gctcgattcg gccggcgctc gattcggccg gcgctcgatt cggccgagca gaagagtgaa 6360 

caaccaccga ccacgcttcc gctctgcgcg ccgtacccga cctacctccc gcagctcgaa 6420 

gcagctcccg ggagtaccgc cgtactcacc cgcctgtgct caccatccac cgacgcaaag 6480 

cccaacccga gcacacctct tgcaccaagg tgccgaccgt ggctttccgc tcgcagggtt 6540 

ccagaagaaa tcgaacgatc cagcgcggca aggttcaaaa agcaggggtt ggtggggagg 6600 

aggttttggg gggtgtcgcc gggatacctg atatggcttt gttttgcgta gtcgaataat 6660 

tttccatata gcctcggcgc gtcggactcg aatagttgat gtgggcgggc acagttgccc 6720 

catgaaatcc gcaacggggg gcgtgctgag cgatcggcaa tgggcggatg cggtgttgct 6780 

tccgcaccgg ccgttcgcga cgaacaacct ccaacgaggt cagtaccgga tgagccgcga 6840 

cgacgcattg gcaatgcggt acgtcgagca ttcaccgcac gcgttgctcg gatctatcgt 6900 

catcgactgc gatcacgttg acgccgcgat gcgcgcattc gagcaaccat ccgaccatcc 6960 

ggcgccgaac tgggtcgcac aatcgccgtc cggccgcgca cacatcggat ggtggctcgg 7020 

ccccaaccac gtgtgccgca ccgacagcgc ccgactgacg ccactgcgct acgcccaccg 7080 

catcgaaacc ggcctcaaga tcagcgtcgg cggcgatttc gcgtatggcg ggcaactgac 7140 

caaaaacccg attcaccccg attgggagac gatctacggc ccggccaccc cgtacacatt 7200 

gcggcagctg gccaccatcc acacaccccg gcagatgccg cgtcggcccg atcgggccgt 7260 

gggcctgggc cgcaacgtca ccatgttcga cgccacccgg cgatgggcat acccgcagtg 7320 

gtggcaacac cgaaacggaa ccggccgcga ctgggaccat ctcgtcctgc agcactgcca 7380 

cgccgtcaac accgagttca cgacaccact gccgttcacc gaagtacgcg ccaccgcgca 7440 
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atccatctcc aaatggatct ggcgcaattt 

gcatctcggt caaaaaggcg gcaaggcaac 

caatgcaaga aagtacgacg aacatacgat 

aaaaatccgg tgcgccgaaa gatgacggca 

actcgcacaa tccaacgctt gtttgctgag 

gctcgccgtg acaaagctgt cgagctgcgg 

gaagcgatgg aactctcgac cgggatcgtc 

ggcgagattt cagcggagga tctgtcggcg 

ccggcgctcg gcactcggac cggccggcgg 

taccgccgaa ggcctgtcat cgaccggctt 

tgattggatg atccgctcac gctcgaccgc 

agcaacggcc aactctcgtt caa 



caccgaagaa cagtaccgag cccgacaagc 7500 

gacactcgcc aaacaagaag ccgtccgaaa 7560 

gcgagaggcg attatctgat gggcggagcc 7620 

gcagcagcag ccgaaaaatt cggtgcctcc 7680 

ccgcgtgacg attacctcgg ccgtgcgaaa 7740 

aagcaggggt tgaagtaccg ggaaatcgcc 7800 

ggccgattac tgcacgacgc ccgcaggcac 7860 

taaccaagtc agcgggttgt cgggttccgg 7920 

atggtgttct gcctctggcg cagcgtcagc 7980 

cgactgaagt atgagcaacg tcacagcctg 8040 

tacctgttca gctgccgccc gctgggcatg 8100 
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<400> 113 

gagctcgacc gcgcgggtcc cggacgggga agagcgggga gctttgccag agagcgacga 60 

cttccccttg cgttggtgat tgccggtcag ggcagccatc cgccatcgtc gcgtagggtg 120 

tcacacccca ggaatcgcgt cactgaacac agcagccggt aggacgacca tgactgagtt 180 

ggacaccatc gcaaatccgt ccgatcccgc ggtgcagcgg atcatcgatg tcaccaagcc 240 

gtcacgatcc aacataaaga caacgttgat cgaggacgtc gagcccctca tgcacagcat 300 

cgcggccggg gtggagttca tcgaggtcta cggcagcgac agcagtcctt ttccatctga 360 
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gttgctggat ctgtgcgggc ggcagaacat accggtccgc ctcatcgact cctcgatcgt 420 

caaccagttg ttcaaggggg agcggaaggc caagacattc ggcatcgccc gcgtccctcg 480 

cccggccagg ttcggcgata tcgcgagccg gcgtggggac gtcgtcgttc tcgacggggt 540 

gaagatcgtc gggaacatcg gcgcgatagt acgcacgtcg ctcgcgctcg gagcgtcggg 600 

gatcatcctg gtggacagtg acatcaccag catcgcggac cggcgtctcc aaagggccag 660 

ccgaggttac gtcttctccc ttcccgtcgt tctctccggt cgcgaggagg ccatcgcctt 720 

cattcgggac agcggtatgc agctgatgac gctcaaggcg gatggcgaca tttccgtgaa 780 

ggaactcggg gacaatccgg atcggctggc cttgctgttc ggcagcgaaa agggtgggcc 840 

ttccgacctg ttcgaggagg cgtcttccgc ctcggtttcc atccccatga tgagccagac 900 

cgagtctctc aacgtttccg tttccctcgg aatcgcgctg cacgagagga tcgacaggaa 960 

tctcgcggcc aaccgataag cgcctctgtt cctcggacgc tcggttcctc gacctcgatt 1020 

cgtcagtgat gatcacctca cacggcagcg atcaccactg acatatcgag gtcaacggtc 1080 

gtggtccggg cgggcactcc tcgaaggcgc ggccgacgcc cttgaacgac tcgatgactc 1140 

tagagtaacg ggctactccg tttaacggac cccgttctca cgctttaggc ttgaccccgg 1200 

agcctgcatg gggcattccg ccgtgaaccc ggtggaatgc ccccggcacc cgggctttcc 1260 

agcaaagatc acctggcgcc gatgagtaag gcgtacagaa ccactccaca ggaggaccgt 1320 

cgagatgaaa tctaacaatg cgctcatcgt catcctcggc accgtcaccc tggatgctgt 1380 

aggcataggc ttggttatgc cggtactgcc gggcctcttg cgggatatcg tccattccga 1440 

cagcatcgcc agtcactatg gcgtgctgct agcgctatat gcgttgatgc aatttctatg 1500 

cgcacccgtt ctcggagcac tgtccgaccg ctttggccgc cgcccagtcc tgctcgcttc 1560 

gctacttgga gccactatcg actacgcgat catggcgacc acacccgtcc tgtggattct 1620 

ctacgccgga cgcatcgtgg ccggcatcac cggcgccaca ggtgcggttg ctggcgccta 1680 

tatcgccgac atcaccgatg gggaagatcg ggctcgccac ttcgggctca tgagcgcttg 1740 

tttcggcgtg ggtatggtgg caggccccgt ggccggggga ctgttgggcg ccatctcctt 1800 

gcatgcacca ttccttgcgg cggcggtgct caacggcctc aacctactac tgggctgctt 1860 

cctaatgcag gagtcgcata agggagagcg tcgtccgatg cccttgagag ccttcaaccc 1920 

agtcagctcc ttccggtggg cgcggggcat gactatcgtc gccgcactta tgactgtctt 1980 

ctttatcatg caactcgtag gacaggtgcc ggcagcgctc tgggtcattt tcggcgagga 2040 

ccgctttcgc tggagcgcga cgatgatcgg cctgtcgctt gcggtattcg gaatcttgca 2100 
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cgccctcgct caagccttcg tcactggtcc cgccaccaaa cgtttcggcg agaagcaggc 2160 

cattatcgcc ggcatggcgg ccgacgcgct gggctacgtc ttgctggcgt tcgcgacgcg 2220 

aggctggatg gccttcccca ttatgattct tctcgcttcc ggcggcatcg ggatgcccgc 2280 

gttgcaggcc atgctgtcca ggcaggtaga tgacgaccat cagggacagc ttcaaggatc 2340 

gctcgcggct cttaccagcc taacttcgat cattggaccg ctgatcgtca cggcgattta 2400 

tgccgcctcg gcgagcacat ggaacgggtt ggcatggatt gtaggcgccg ccctatacct 2460 

tgtctgcctc cccgcgttgc gtcgcggtgc atggagccgg gccacctcga cctgaatgga 2520 

agccggcggc acctcgctaa cggattcacc actccaagaa ttggagccaa tcaattcttg 2580 

cggagaactg tgaatgcgca aaccaaccct tggcagaaca tatccatcgc gtccgccatc 2640 

tccagcagcc gcacgcggcg catctcgggc agcgttgggt cctggccacg ggtgcgcatg 2700 

atcgtgctcc tgtcgttgag gactagaatt gatctcctcg accgccaatt gggcatctga 2760 

gaatcatctg cgtttctcgc acgcaacgta cttgcaacgt tgcaactcct agtgttgtga 2820 

atcacacccc accggggggt gggattgcag tcaccgattt ggtgggtgcg cccaggaaga 2880 

tcacgtttac ataggagctt gcaatgagct actccgtggg acaggtggcc ggcttcgccg 2940 

gagtgacggt gcgcacgctg caccactacg acgacatcgg cctgctcgta ccgagcgagc 3000 

gcagccacgc gggccaccgg cgctacagcg acgccgacct cgaccggctg cagcagatcc 3060 

tgttctaccg ggagctgggc ttcccgctcg acgaggtcgc cgccctgctc gacgacccgg 3120 

ccgcggaccc gcgcgcgcac ctgcgccgcc agcacgagct gctgtccgcc cggatcggga 3180 

aactgcagaa gatggcggcg gccgtggagc aggcgatgga ggcacgcagc atgggaatca 3240 

acctcacccc ggaggagaag ttcgaggtct tcggcgactt cgaccccgac cagtacgagg 3300 

aggaggtccg ggaacgctgg gggaacaccg acgcctaccg ccagtccaag gagaagaccg 3360 

cctcgtacac caaggaggac tggcagcgca tccaggacga ggccgacgag ctcacccggc 3420 

gcttcgtcgc cctgatggac gcgggtgagc ccgccgactc cgagggggcg atggacgccg 3480 

ccgaggacca ccggcagggc atcgcccgca accactacga ctgcgggtac gagatgcaca 3540 

cctgcctggg cgagatgtac gtgtccgacg aacgtttcac gcgaaacatc gacgccgcca 3600 

agccgggcct cgccgcctac atgcgcgacg cgatcctcgc caacgccgtc cggcacaccc 3660 

cctgagcggt ggtcgtggcc cgggtctccc gcccggtctc accccacggc tcactcccgg 3720 

gccacgacca ccgccgtccc gtacgcgcac acctcggtgc ccacgtccgc cgcctccgtc 3780 

acgtcgaaac ggaagatccc cgggtaccga gctcgtcagg tggcactttt cggggaaatg 3840 
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tgcgcggaac ccctatttgt ttatttttct aaatacattc aaatatgtat ccgctcatga 3900 

gacaataacc ctgataaatg cttcaataat attgaaaaag gaagagtatg agtattcaac 3960 

atttccgtgt cgcccttatt cccttttttg cggcattttg ccttcctgtt tttgctcacc 4020 

cagaaacgct ggtgaaagta aaagatgctg aagatcagtt gggtgcacga gtgggttaca 4080 

tcgaactgga tctcaacagc ggtaagatcc ttgagagttt tcgccccgaa gaacgttttc 4140 

caatgatgag cacttttaaa gttctgctat gtggcgcggt attatcccgt attgacgccg 4200 

ggcaagagca actcggtcgc cgcatacact attctcagaa tgacttggtt gagtactcac 4260 

cagtcacaga aaagcatctt acggatggca tgacagtaag agaattatgc agtgctgcca 4320 

taaccatgag tgataacact gcggccaact tacttctgac aacgatcgga ggaccgaagg 4380 

agctaaccgc ttttttgcac aacatggggg atcatgtaac tcgccttgat cgttgggaac 4440 

cggagctgaa tgaagccata ccaaacgacg agcgtgacac cacgatgcct gtagcaatgg 4500 

caacaacgtt gcgcaaacta ttaactggcg aactacttac tctagcttcc cggcaacaat 4560 

taatagactg gatggaggcg gataaagttg caggaccact tctgcgctcg gcccttccgg 4620 

ctggctggtt tattgctgat aaatctggag ccggtgagcg tgggtctcgc ggtatcattg 4680 

cagcactggg gccagatggt aagccctccc gtatcgtagt tatctacacg acggggagtc 4740 

aggcaactat ggatgaacga aatagacaga tcgctgagat aggtgcctca ctgattaagc 4800 

attggtaact gtcagaccaa gtttactcat atatacttta gattgattta aaacttcatt 4860 

tttaatttaa aaggatctag gtgaagatcc tttttgataa tctcatgacc aaaatccctt 4920 

aacgtgagtt ttcgttccac tgagcgtcag accccgtaga aaagatcaaa ggatcttctt 4980 

gagatccttt ttttctgcgc gtaatctgct gcttgcaaac aaaaaaacca ccgctaccag 5040 

cggtggtttg tttgccggat caagagctac caactctttt tccgaaggta actggcttca 5100 

gcagagcgca gataccaaat actgttcttc tagtgtagcc gtagttaggc caccacttca 5160 

agaactctgt agcaccgcct acatacctcg ctctgctaat cctgttacca gtggctgctg 5220 

ccagtggcga taagtcgtgt cttaccgggt tggactcaag acgatagtta ccggataagg 5280 

cgcagcggtc gggctgaacg gggggttcgt gcacacagcc cagcttggag cgaacgacct 5340 

acaccgaact gagataccta cagcgtgagc tatgagaaag cgccacgctt cccgaaggga 5400 

gaaaggcgga caggtatccg gtaagcggca gggtcggaac aggagagcgc acgagggagc 5460 

ttccaggggg aaacgcctgg tatctttata gtcctgtcgg gtttcgccac ctctgacttg 5520 

agcgtcgatt tttgtgatgc tcgtcagggg ggcggagcct atggaaaaac gccagcaacg 5580 
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cggccttttt acggttcctg gccttttgct ggccttttgc tcacatgttc tttcctgcgt 5640 

tatcccctga ttctgtggat aaccgtatta ccgcctttga gtgagctgat accgctcgcc 5700 

gcagccgaac gaccgagcgc agcgagtcag tgagcgagga agcggaagag cgcccaatac 5760 

gcaaaccgcc tctccccgcg cgttggccga ttcattaatg cagctggcac gactagagtc 5820 

ccgctgaggc ggcgtagcag gtcagccgcc ccagcggtgg tcaccaaccg gggtggaacg 5880 

gcgccggtat cgggtgtgtc cgtggcgctc attccaacct ccgtgtgttt gtgcaggttt 5940 

cgcgtgttgc agtccctcgc accggcaccc gcagcgaggg gctcacgggt gccggtgggt 6000 

cgactagttc agtgatggtg atggtgatgt cctcgagatc taagcttgga tccgcggccg 6060 

ctacgtagaa ttcccatatg tatatctcct tcttaaagtt aaacaaaatt atttctagac 6120 

gccgtccacg ctgcctcctc acgtgacgtg aggtgcaagc ccggacgttc cgcgtgccac 6180 

gccgtgagcc gccgcgtgcc gtcggctccc tcagcccggg cggccgtggg agcccgcctc 6240 

gatatgtaca cccgagaagc tcccagcgtc ctcctgggcc gcgatactcg accaccacgc 6300 

acgcacaccg cactaacgat tcggccggcg ctcgattcgg ccggcgctcg attcggccgg 6360 

cgctcgattc ggccggcgct cgattcggcc ggcgctcgat tcggccgagc agaagagtga 6420 

acaaccaccg accacgcttc cgctctgcgc gccgtacccg acctacctcc cgcagctcga 6480 

agcagctccc gggagtaccg ccgtactcac ccgcctgtgc tcaccatcca ccgacgcaaa 6540 

gcccaacccg agcacacctc ttgcaccaag gtgccgaccg tggctttccg ctcgcagggt 6600 

tccagaagaa atcgaacgat ccagcgcggc aaggttcaaa aagcaggggt tggtggggag 6660 

gaggttttgg ggggtgtcgc cgggatacct gatatggctt tgttttgcgt agtcgaataa 6720 

ttttccatat agcctcggcg cgtcggactc gaatagttga tgtgggcggg cacagttgcc 6780 

ccatgaaatc cgcaacgggg ggcgtgctga gcgatcggca atgggcggat gcggtgttgc 6840 

ttccgcaccg gccgttcgcg acgaacaacc tccaacgagg tcagtaccgg atgagccgcg 6900 

acgacgcatt ggcaatgcgg tacgtcgagc attcaccgca cgcgttgctc ggatctatcg 6960 

tcatcgactg cgatcacgtt gacgccgcga tgcgcgcatt cgagcaacca tccgaccatc 7020 

cggcgccgaa ctgggtcgca caatcgccgt ccggccgcgc acacatcgga tggtggctcg 7080 

gccccaacca cgtgtgccgc accgacagcg cccgactgac gccactgcgc tacgcccacc 7140 

gcatcgaaac cggcctcaag atcagcgtcg gcggcgattt cgcgtatggc gggcaactga 7200 

ccaaaaaccc gattcacccc gattgggaga cgatctacgg cccggccacc ccgtacacat 7260 

tgcggcagct ggccaccatc cacacacccc ggcagatgcc gcgtcggccc gatcgggccg 7320 
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Fig. 8 
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Fig. 1 1 
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Fig. 19 
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Fig. 20 



Name of plas 


Functionally known protein (two or more obtained) 




. Type 
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Number 

16 


LE20 


Serum amyloid A (Saal) 


M11131 




6 


L113 


NADH dehydorogenase 1 Of 4 


BC011114 
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LE59 


Pantotenate kinase 1 B 


AF200357 




2 


LE94 


Retinol bindins protein 4 (RBP4) 


AK008765 




2 


LE98 


Major urinary Drotein 4 like 


BC019965 
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LE287 


Histidine-rich glycoprotein 


NM 053176 




o 




Functionally unknown protein (singly obtained) 




24 


24 


L3 


Cytochrome b5 like 


AK002426 






LE2 


Fibrinogen A alpha 


BC005467 






LE3 


Clusterin 


NM_013492 






LE9 


SDlicinfiT factor 3h suhiinit 1 1 RRLf^a 


NM_031179 




^ . 
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LEI 2 


Hfl nt A orl nhtn 


NM_017370 






LEI 8 




80019578 
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^*^ter~alpha~trypsin inhibitor Heavy Chain 2 
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ocrum aiDumin 
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/^ryidcaLaLc Qsacexyiase 
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i^ew cuiNM, nigniy similar to UUr'~Giycosyltransferase 
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RIKEN 130001 7 J02, Highly similar to Transferrin 
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Phosphatidytinositol 3-kinase 


NM 008839 







LE178 


Protein kinase C receptor (RACK1) like 
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LE204 


EGF receptor 


AF275367 
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Rettnoic acid receptor responsive protein TIG2 
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Insulin-like growth factor lA 


X04480 






1 coon 


Transferrin 


BC022986 






LE295 


Apolipoprotein A-V 


NM_080434 






LE305 


Fatty Acid Binding Protein 1 (FABP1) 


BC009812 






LE354 


Retinoblastoma binding protein 7 (Rbbp7) 


NM_009031 






LE357 


Zinc fingers and homeoboxes protein 1 (Zhxl) 


NM_009572 
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Tumor differentially expressed 1 like (Tdell) 


NM_019760 






LE421 


RIKEN1300006C19. Highly similar to OSTSTT3 
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Functionally unknown protein 
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LE25 


IMAGE:4239007. DUF92 like membrane Protein? 


3001 6895 






LE51 


New cONA. No homology 








LE1 19 
LEI 23 


IMAGE:3489640. Bone marrow stromal protein like^ j 
R1KEN1500015G18. No homology | 


30008532 
SIM_025439 
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